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SINGLE CHIP VLSI IMPLEMENTATION OF A DIGITAL RECEIVER EMPLOYING 
ORTHOGONAL FREQUENCY DIVISION MULTIPLEXING 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention. 

This invention relates to receivers of electromagnetic signals employing 
multicarrier modulation. More particularly this invention relates to a digital receiver which 
is implemented on a single VLSI chip for receiving transmissions employing orthogonal 
frequency division multiplexing/and which is suitable for the reception of digital video 

broadcasts. 

2. Description of the Related Art. 

Coded orthogonal frequency division multiplexing ("COFDM") has been proposed 
for digital audio and digital video broadcasting, both of which require efficient use of 
limited bandwidth, and a method of transmission which is reliable in the face of several 
effects. For example the impulse response of a typical channel can be modeled as the 
sum of a plurality of Dirac pulses having different delays. Each pulse is subject to a 
multiplication factor, in which the amplitude generally follows a Rayleigh law. Such a 
pulse train can extend over several microseconds, making unencoded transmission at 
high bit rates unreliable. In addition to random noise, impulse noise, and fading, other 
major difficulties in digital terrestrial transmissions at high data rates include multipath 
propagation, and adjacent channel interference, where the nearby frequencies have 
highly correlated signal variations: COFDM is particularly suitable for these applications. 
In practical COFDM arrangements, relatively small amounts of data are modulated onto 
each of a large number of carriers that are closely spaced in frequency. The duration 
of a data symbol is increased in the same ratio as the number of carriers or 
subchannels, so that inter-symbol interference is markedly reduced. 

Multiplexing according to COFDM is illustrated in Figs. 1 and 2, wherein the 
spectrum of a single COFDM carrier or subchannel is indicated by line 2. A set of carrier 
frequencies is indicated by the superimposed waveforms in Fig. 2, where orthogonality 
conditions are satisfied. In general two real-valued functions are orthogonal if 
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|Y p (tK(t)dt = K 
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where K is a constant, and K = 0 if p * q; K *0 if p = q. Practical encoding and decoding 
of signals according to COFDM relies heavily on the fast Fourier transform ("FFT"), as 
can be appreciated from the following equations. 
The signal of a carrier c is given by 

s c (t) = A c (t)e i(w < , **< (,)l (2) 

where A c is the data at time t, co c is the frequency of the carrier, and <$> c is the phase. N 
carriers in the COFDM signal is given by 



10 s s (t) .= (1/N) £ A n (X)e l t< *» , -*" (,)l ( 3 ) 

n = 0 



20 



w n = w o + "Aw (4) 
. iSampling over one symbol period, then 

^(^i (5) 

AcW^An (6) 
<With a sampling frequency of 1/T, the resulting signal is represented by 

- n - 0 

Sampling over the period of one data symbol t = NT, with co 0 = 0, , 

N-1 f 

s s (kT) = (1/N). £ A n e j: Sj< nAa) > kT (8) . 

n = o 

which compares with the general form of the inverse discrete Fourier transform: 

N-1 

30 . g(kT) = (l/N) £ GCn/CkTJJe^^ (9) 



In the above equations A,^,, is the input signal in the sampled frequency domain, and 
s s (kT) is the time domain representation. It is known that increasing the size of the FFT 
provides, longer symbol durations and improves ruggedness of the system as regards 
35 echoes which exceed the length of the guard interval. However computational 
complexity increases according to Nlog 2 N, and is a practical limitation. 



In the presence of intersymbol interference caused by the transmission channel, 
orthogonality between the signals is not maintained. One approach to this problem has 
been to deliberately sacrifice some of the emitted energy by preceding each symbol in 
the time domain by an interval which exceeds the memory of the channel, and any 
multipath delay. The "guard interval" so chosen is large enough to absorb any 
intersymbol interference, and is established by preceding each symbol by a replication 
of a portion of itself. The replication is typically a cyclic extension of the terminal portion 
of the symbol. Referring to Fig. 3, a data symbol 4 has an active interval 6 which 
contains all the data transmitted in the symbol. The terminal portion 8 of the active / 
interval 6 is repeated at the beginning of the symbol as the guard interval 10. The 
COFDM signal is represented by the solid line 12. It is possible to cyclically repeat the 
initial portion of the active interval 6 at the end of the symbol. 

Transmission of COFDM data can be accomplished according to the known 
general scheme shown in Fig. 4. A serial data stream 14 is converted to a series of 
parallel streams 16 in a serial-to-parallel converter 18. Each of the parallel streams 16 
is grouped into x bits each to form a complex number, where x determines the signal 
constellation of its associated parallel stream. After outer coding and interleaving in 
block 20 pilot carriers are inserted via a signal mapper 22 for use in synchronization and 
channel estimation in the receiver. The pilot carriers are typically of two types. Continual 
pilot carriers are transmitted in the same location in each symbol, with the same phase 
and amplitude. In the receiver, these are utilized for phase noise cancellation, automatic 
frequency control, and time/sampling synchronization. Scattered pilot carriers are 
distributed throughout the symbol, and their location typically changes from symbol to 
symbol. They are primarily useful in channel estimation. Next the complex numbers are 
modulated at baseband by the inverse fast fourier transform ("IFFT") in block 24. A 
guard interval is then inserted at block 26. The discrete symbols are then converted to 
analog, typically low-pass filtered, and then upconverted to radiofrequency in block 28. 
The signal is then transmitted through a channel 30 and received in a receiver 32. As 
is well known in the art, the receiver applies an inverse of the transmission process to 
obtain the transmitted information. In particular an FFT is applied to demodulate the 
signal. 

~ - A modem application of COFDM has been proposed in the European Telecommu- 
nications Standard ETS 300 744 (March 1997), which specifies the framing structure, 
channel coding, and modulation for digital terrestrial television, The specification was 
designed to accommodate digital terrestrial television within the. existing spectrum 
allocation for analog transmissions, yet provide adequate protection against high levels 
of co-channel interference and adjacent channel interference. A flexible guard interval 



is specified, so that the system can support diverse network configurations, while 
maintaining high spectral efficiency, and sufficient protection against co-channel 
interference and adjacent channel interference from existing PAL/SECAM sen/ices. The 
noted European Telecommunications Standard defines two modes of operation. A "2K 
mode", suitable for single transmitter operation and for small single frequency networks 
with limited transmitter distances. An "8K mode" can be used for either single transmitter 
operation or for large single frequency networks. Various levels of quadrature amplitude 
modulation ("QAM") are supported, as are different inner code rates, in order to balance 
bit rate against ruggedness. The system is intended to accommodate a transport layer 
according to the.Moving Picture Experts Group ("MPEG"), and is directly compatible with 
M P E G - 2 W;c oded TV ^signals (ISO/hE C~ 13818). 

In the noted European Telecommunications Standard data carriers in a COFDM 
frame can be either quadrature phase shift keyed ("QPSK"), 16-QAM, 64-QAM, non- 
uniform 16-QAM, or non-uniform 64-QAM using Gray mapping. 

An important problem in the reception of COFDM transmission is difficulty in 
maintaining synchronization due to phase noise and jitter which arise from upconversion 
prior to transmission, downconversion in the receiver, and the front end oscillator? in the 
tuner, which is typically a voltage controlled oscillator. Except for provision" of pilot 
carriers to aid in synchronization during demodulation, these issues are not specifically 
addressed in the noted European Telecommunications Standard, but are left for the 
implementer to solve. 

Basically phase disturbances are of two types. First, noisy components which 
disturb neighbor carriers in a multicarrier system are called the "foreign noise contribu- 
tion" ("FNC"). Second, a noisy component which .disturbs its own carrier is referred to 
as the "own noise contribution". t 

Referring to Fig. 5, the position of ideal constellation samples are indicated by "x" 
symbols 34. The effect of foreign noise contribution is stochastic, resulting in Gaussian- 
like noise. Samples perturbed in this manner are indicated on Fig. 5 as circles 36. The 
effects of the own noise contribution is a common rotation of all constellation points, 
indicated as a displacement between each u x" symbol 34 and its associated circle 36. 
This is referred to as the "common phase error", which notably changes from symbol to 
symbol, and must therefore be recalculated each symbol period T s . The common phase 
error may also be interpreted as a mean phase deviation during the symbol period T s . 

In order for the receiver 32 to process the data symbols in a practical system, a 
mathematical operation is performed on the complex signal representing each data 
symbol. Generally this is an FFT. For valid results to be obtained, a particular form of 



timing synchronization is required in order to align the FFT interval witrvthe received 
data symbol. 

SUMMARY OF THE INVENTION 

It is therefore a primary object of the invention to provide a highly integrated, low 
cost apparatus for the reception of digital broadcasts, such as terrestrial digital video 
broadcasts, which is implemented on a single VLSI chip. 

It is another object of the invention to provide an improved method and apparatus 
for synchronizing a received data symbol with an FFT window in signals transmitted 

according to COFDM. 

It is yet another object of the invention to improve the stability of digital multicarrier 

receivers in respect of channel estimation. 

: "lt is still another object of the invention to improve the automatic frequency control 
circuitry employed in multicarrier digital receivers. 

It is a further object of the invention to improve the automatic sampling rate, control 
circuitry employed in multicarrier digital receivers. 

The invention provides a digital receiver for multicarrier signals that are transmitted 
by orthogonal frequency division multiplexing. The multicarrier signal carries a stream 
of data symbols having an active interval, and a guard interval in which the guard., 
interval is a replication of a portion of the active interval. In the receiver an analog to 
digital converter is coupled to a front end amplifier. An l/Q demodulator is provided for 
recovering in phase and quadrature components from data sampled by the analog to 
digital converter, and an automatic gain control circuit is coupled to the analog to digital 
converter. In a low pass filter circuit accepting I and Q data from the^l/Q demodulator, 
the I and Q data are decimated and provided to a resampling circuit. An interpolator in 
the resampling circuit accepts the decimated I and Q data at a first rate and outputs 
resampled I and Q data at a second rate. An FFT window synchronization circuit is 
coupled to the resampling circuit for locating a boundary of the guard interval. A real- 
time pipelined FFT processor is operationally associated with the FFT window 
synchronization circuit. Each stage of the FFT processor has a complex coefficient 
multiplier, and an associated memory with a lookup table defined therein for multipli- 
cands being multiplied in the complex coefficient multiplier. Each multiplicand in the 
lookup table is unique in value. A monitor circuit responsive to the FFT window 
synchronization circuit detects a predetermined indication that a boundary between an 
active symbol and a guard interval has been located. 

According to an aspect of the invention the FFT window synchronization circuit has 
a first, delay element accepting currently arriving resampled I and Q data, and outputting 
delayed resampled I and Q data. A subtracter produces a signal representative of the 



difference between the currently arriving resampled I and Q data and the delayed 
resampled I and Q data. In a first circuit the subtracter output signal is converted to a 
signal having a unipolar magnitude, which is preferably the absolute value of the signal 
provided by the subtracter. A second delay element stores the output signal of the first 
5 circuit, and a third delay element receives the delayed output of the second delay 
element. In a second circuit a statistical relationship is calculated between data stored 
' in the second delay element and data stored in the third delay element., The output of 
the FFT window synchronization circuit is representative of the statistical relationship. 
Preferably the statistical relationship is the F ratio. The FFT processor is capable of 

10 operation in a 2K mode and in an 8K mode. 

The FFT processor has an address generator for the memory oteach stage, which 
accepts a signal representing the order dependency of a currently required multiplicand, 
and generates an address of the memory wherein the currently required multiplicand is 
stored. In a further aspect of the invention each multiplicand is stored in the lookup table 

1 5 in order of its respective order dependency for multiplication by the complex coefficient 
.multiplier, so that the order dependencies of the multiplicands define an incrementation 
sequence. The address generator has an accumulator for storing a previous address 
.that was generated thereby, a circuit for calculating an incrementation value of the 
currently required multiplicand responsive to the incrementation sequence, and an adder 

20 for adding the, incrementation value to the previous address. 

In another aspect of the invention there are a plurality of incrementation 
sequences. The multiplicands are stored in row order, wherein in a first row a first 
incrementation sequence is 0, in a second row a second incrementation sequence is 1 , 
in a third row first and second break points B1, B2 of a third incrementation sequence 

25-- are respectively determined by the relationships ~- 
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B1 Mn - 4 n B1 Mn - N £ 4" 

n=0 



N 



B2 Mw - £ 4" 



n*0 



and in a fourth row a third break point B3 of a third incrementation sequence is 
determined by the relationship 

35- B3 " N = 2*4" + 2 

wherein M N represents the memory of an Nth stage of the FFT processor. 



The receiver provides channel estimation and correction circuitry. jPilot location 
circuitry receives a transformed digital signal representing a frame from the FFT 
processor, and identifies the position of pilot carriers therein. The pilot carriers are 
spaced apart in a carrier spectrum of the transformed digital signal at intervals K and 
have predetermined magnitudes. The pilot location circuitry has a first circuit for 
computing an order of carriers in the transformed digital signal, positions of said carriers 
being calculated modulo K. There are K accumulators coupled to the second circuit for 
accumulating magnitudes of the carriers in the transformed digital signal, the 
accumulated magnitudes defining a set. A correlation circuit is provided for correlat.ng 
K sets of accumulated magnitude values with the predetermined magnitudes. In the 
correlation a first member having a position calculated modulo K in of each of the K sets 
is uniquely offset from a start position of the frame. 

According to another aspect of the invention the pilot location circuitry also has a 
bit reversal circuit for reversing the bit order of the transformed digital signal. 

: According to yet another aspect of the invention amplitudes are used to represent . 
the magnitudes of the carriers. Preferably the magnitudes of the carriers and the 
predetermined magnitudes are absolute values. 

In a further aspect of the invention the correlation circuitry also has a peak tracking 
circuit for determining the spacing between a first peak and a second peak of the K sets 
of accumulated magnitudes, wherein the first peak is the maximum magnitude, and the 
second peak is the second highest magnitude. 

The channel estimation and correction circuitry also has an interpolating filter for 
estimating the channel response between the pilot carriers, and a multiplication circuit 
for multiplying data carriers output by the FFT processor with a correction coeff.cient 
produced by the interpolating filter. 

-*The channel estimation and correction circuitry also has a phase extraction circuit 
accepting a data stream of phase-uncorrected I and Q data from the FFT processor, 
and producing a signal representative of the phase angle of the uncorrected data. The 
phase extraction circuit includes an accumulator for the phase angles of succeeding 
phase-uncorrected I and Q data. 

According to an aspect of the invention the channel estimation and correction 
circuitry includes an automatic frequency control circuit coupled to the phase extract.on 
circuit, in which a memory stores the accumulated common phase error of a first symbol 
carried in the phase-uncorrected I and Q data. An accumulator is coupled to the 
memory and accumulates differences between the common phase error of a plurality 
of pilot carriers in a second symbol and the common phase error of corresponding p.lot 
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carriers in the first symbol. The output of the accumulator is filtered, and coupled to the 
l/Q demodulator. 

According to another aspect of the invention the coupled output of the accumula- 
tor of the automatic frequency control circuit is enabled in the l/Q demodulator only 
during reception of a guard interval therein. 

According to yet another aspect of the invention the channel estimation and 
correction circuitry also has an automatic sampling rate control circuit coupled to the 
phase extraction circuit, in which a memory stores the individual accumulated phase 
errors<of pilot carriers in a first symbol carried in the phase-uncorrected I and Q data. 
An accumulator is coupled to the memory and accumulates differences. between the 
phase, errors of individual pilot carriers in a second symbol and phase errors of 
corresponding pilot carriers in the first symbol to define a plurality of accumulated 
intersymbol carrier phase error differentials. A phase slope is defined by a difference 
between a first accumulated intersymbol carrier phase differential and a second 
accumulated intersymbol carrier phase differential. The output of the accumulator is . 
filtered and coupled to the l/Q demodulator. 

According to one aspect of the invention the sampling rate control circuit stores a 
plurality of accumulated intersymbol carrier phase error differentials and computesa line 
of best fit therebetween. 

According to another aspect of the invention the coupled output signal of the 
accumulator of the automatic sampling rate control circuit is enabled in the resampling 
circuit only during reception of a guard interval therein. 

According to an aspect of the invention a common memory for storing output of 
the phase extraction circuit is coupled .to the.automatic frequency, control circuit and to 
the automatic sampling rate control circuit. , . . 

According to another aspect of. the invention the phase extraction circuit also has 
a pipelined circuit for iteratively computing the arctangent of an angle of rotation 
according to the series 

tan' 1 (x) = x-^+^-2^ + *-!-. ; |x|<1 
3 5 7 9 11 

wherein x is a ratio of the phase-uncorrected I and Q data. 

The pipelined circuit includes a constant coefficient multiplier, and a multiplexer 
for selecting one of a plurality of constant coefficients of the series. An output of the 
multiplexer is connected to an input of the constant coefficient multiplier. 

According to still another aspect of the invention the pipelined circuit has a 
multiplier, a first memory for storing the quantity x 2 , wherein the first memory is coupled 



to a first input of the multiplier, and has a second memory for holding any output of the 
multiplier. A feedback connection is provided between the second memory and a 
second input of the multiplier. The pipelined circuit also has a third memory for storing 
the value of the series. Under direction of a control circuit coupled to the third memory, 
the pipeline circuit computes N terms of the series, and also computes N+1 terms of the 
series. An averaging circuit is also coupled to the third memory and computes the 
average of N terms and N+1 terms of the series. 

Data transmitted in a pilot carrier of the multicarrier signal is BCH encoded 
according to a code generator polynomial h(x). A demodulator operative on the BCH 
encoded data is provided, which includes an iterative pipelined BCH.decoding circuit. 
The BCH decoding circuit is circuit coupled to the demodulator. It forms a Galois Field 
of the' polynomial, and calculates a plurality of syndromes therewith. The BCH decoding 
circuit includes a plurality of storage registers, each storing a respective one of the 
syndromes, and a plurality of feedback shift registers, each accepting data from a 
respective one of the storage registers. The BCH decoding circuit has a plurality of 
-Galois field multipliers. Each of the multipliers is connected in a feedback loop across 
a respective one of the feedback shift registers and multiplies the output of its 
associated feedback shift register by an alpha value of the Galois Field. An output 
Galois field multiplier multiplies the outputs of two of the feedback shift registers. 

A logical network forms an error detection circuit connected to the feedback shift 
registers and to the output Galois field multiplier. The output of the error detection circuit 
indicates an error in a current bit of data, and a feedback line is enabled by the error 
detection logic and connected to the storage registers. Using the feedback line, the data 
output by the feedback shift registers are written back into the storage registers for use 

in a-second iteration. 

^"According to an aspect of the invention the output Galois field multiplier has a first 
register initially storing a first multiplicand A, a constant coefficient multiplier connected 
to the first register for multiplication by a value a. An output of the constant coefficient 
multiplier is connected to the first register to define a first feedback loop, whereby in a 
kth cycle of clocked operation the first register contains a Galois field product Aa k . A 
second register is provided for storing a second multiplicand B. An AND gate is 
connected to the second register and to the output of the constant coefficient multiplier. 
An adder has a first input connected to an output of the AND gate. An accumulator is 
connected to a second input of the adder, and the Galois field product AB is output by 
the adder. 

The invention provides a method for the estimation of a frequency response of a 
channel. It is performed by receiving from a channel an analog multicarrier signal that 
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has a plurality of data carriers and scattered pilot carriers. The scattered pilot carriers 
are spaced apart at an interval N and are transmitted at a power that differs from the 
transmitted power of the data carriers. The analog multicarrier signal is converted to a 
digital representation thereof. A Fourier transform is performed on the digital representa- 
tion of the multicarrier signal to generate a transformed digital -signal. The bit order of 
the transformed digital signal is reversed to generate a bit-order reversed signal. 
Magnitudes of the carriers in the bit-order reversed signal are cyclically accumulated in 
N accumulators, amd the accumulated magnitudes are correlated with the power of the 
scattered pilot carriers. Responsive to the correlation, a synchronizing signal is 
generated that identifies a carrier position of-the. multicarrier signal, preferably an active 
carrier. " V - 

According to another aspect of the invention the step of accumulating magnitudes 
is performed by adding absolute values of a real component of the bit-order reversed 
signal to respective absolute values of imaginary components thereof to generate sums, 
and respectively storing the sums in the N accumulators. 

According to yet another aspect of the invention the step of correlating the 
accumulated magnitudes also is performed by identifying a first accumulator having the 
highest of the N values stored therein, which represents a first carrier position, and by 
identifying a second accumulator which has the second highest of the N values stored 
therein, which represents a second carrier position. The interval between the first carrier- 
position and the second carrier position is then determined. 

To validate the consistency of the carrier position identification/ the position of a 
carrier of a first symbol in the bit-order reversed signal is compared with a. position of a 
corresponding carrier of a second symbol therein. 

Preferably interpolation is performed between pilot carriers to determine correction; 
factors for respective intermediate data carriers disposed therebetween, and 
respectively adjusting magnitudes of the intermediate data carriers according to the 
correction factors. 

According to an aspect of the invention a mean phase difference is determined 
between corresponding pilot carriers of successive symbols of the transformed digital 
signal. A first control signal representing the mean phase difference, is provided to 
control the frequency of reception of the multicarrier signal. The first control signal is 
enabled only during reception of a guard interval. 

Preferably a line of best fit is determined for the inter-symbol phase differences of 
multiple carriers to define a phase slope. 
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BRIEF DESCRIPTION OF THE DRAWING 

For a better understanding of these and other objects of the present invention, 
reference is made to the detailed description of the invention, by way of example, which 
is to be read in conjunction with the following drawings, wherein: 

Fig. 1 illustrates the spectrum of a COFDM subchannel; 

Fig. 2 shows a frequency spectrum for multiple carriers in a COFDM signal; 

Fig. 3 is a diagram of a signal according to COFDM and shows a data symbol 
format; 

Fig. 4 is a block diagram illustrating an FFT based COFDM system; 
.Fig. 5 illustrates certain perturbations in a COFDM signal constellation; 
"pig. 6 is a flow diagram of a method of timing synchronization according to a 
preferred embodiment of the invention; 

Fig. 7 is a plot of an F ratio test performed on several data symbols for coarse 

timing synchronization; 

Fig. 8 is a plot of an incomplete beta function for different degrees of freedom; . 

Fig. 9 is a plot helpful in understanding a test of statistical significance according 
to the invention; 

Fig. 10 is an electrical schematic of a synchronization circuit according to an 
alternate embodiment of the invention; 

Fig. 1 1 is an electrical schematic of a synchronization circuit according to another 
alternate embodiment of the invention; 

Fig. 12 is a block diagram of a single-chip embodiment of a digital receiver in 

accordance with the invention; 

sr Fig. 13 is a block diagram illustrating the front end of the digital receiver shown in 

Fig. 12 in further detail; 

Fig. 14 is a block diagram illustrating the FFT circuitry, channel estimation and 
correction circuitry of the digital receiver shown in Fig. 1 2; 

Fig. 1 5 is a block diagram illustrating another portion of the digital receiver shown 

in Fig. 12; " 

Fig. 16 is a more detailed block diagram of the channel estimation and correction 

circuitry shown in-Fig. 1 4j_ 

Fig. 17 is a schematic of the automatic gain control circuitry of the digital receiver 

shown in Fig. 12; 

Fig. 18 is a schematic of the l/Q demodulator of the digital receiver shown in Fig. 

12; 

Fig. 19 illustrates in greater detail a low pass filter shown in Fig. 13; 
Fig. 20 shows the response of the low pass filter shown in Fig. 19; 
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Fig. 21 shows the resampling circuitry of the digital receiver shown in Fig. 12; 
Fig. 22 illustrates a portion of an interpolator in the resampling circuitry of Fig. 21 ; 
Fig. 23 is a more detailed block diagram of the FFT window circuitry shown in Fig. 

14; 

Fig. 24 is a schematic of a butterfly unit in the FFT calculation circuitry shown in 
Fig. 14; 

Figs. 25 and 26 are schematics of butterfly units in accordance with the prior art; 
Fig. 27 is a schematic of a radix 2 2 + 2 FFT processor in accordance with the 
invention; 

Fig. 28 is 32 point flow graph of the FFT processor shown in Fig, 27; 

Fig. 29 is a schematic of a configurable 2K/8K radix 2 2 +2 single path, delay 
feedback pipelined FFT processor in accordance with the invention; 

Fig. 30 is a detailed schematic of a complex multiplier used in the circuitry shown 
in Fig. 29; 

Fig. 31 is a detailed schematic of an alternate embodiment of a complex multipliers 
used in the circuitry shown in Fig, 29; 

Fig. 32 is another diagram illustrating the organization of the twiddle factors for 
each o'f the multipliers in the circuitry shown in Fig. 29; 

Fig. 33 illustrates the organization of the twiddle factors for each of the multipliers 
"in the circuitry shown in Fig. 29; 

Fig. 34 is a schematic of address generator used in the circuitry shown in Fig. 29; 

Fig. 35 is a schematic of a generalization of the address generator shown in Fig. 
34; . , . 

Fig. 36 is a flow chart illustrating the process of pilot location conducted by the 
channel estimation and correction circuitry shown in Fig. 16; 1 ' 

Fig. 37 is a flow chart of an embodiment of the pilot localization procedure 
according to the invention. 

Fig. 38 is a more detailed block diagram of the tps sequence block of the circuitry 
shown in Fig/ 14; 

Fig. 39 is a schematic of a BCH decoder used in the tps processing circuitry shown 
.in Fig. 38; . 

- Fig. 40 js a- more detailed schematic of a Galois field multiplier shown in Fig. 39; 

Fig. 41 is a block diagram generically illustrating the automatic sampling control 
and automatic frequency control loops of the digital receiver shown in Fig. 12; 

Fig. 42 is a more detailed block diagram of the automatic sampling control and 
automatic frequency control loops shown in Fig. 41 ; 
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Fig. 43 is a more detailed block diagram of the phase extract block of the circuitry 
shown in Fig. 42; 

Fig. 44 is a schematic of the circuitry employed to calculate an arctangent in the 
block diagram shown in Fig. 43; 

Fig, 45 is a plot of the square error at different values of a of the Taylor expansion 

to 32 terms; 

- Fig. 46 is a plot of the square error at different values of a of the Taylor expansion 
to 31 terms; 

Fig. 47 is a plot of the square error at different values of a of the average of the 
Taylor expansion to 31 and 32 terms; 

Fig. 48 is a plot of the phase differences of pilot carriers with a line of best fit 

shown; 

Fig. 49 is a more detailed block diagram an alternate embodiment of the automatic 

sampling control and automatic frequency control loops shown in Fig. 41 ; 

Fig. 50 illustrates a coded constellation format used in the demapping circuitry of 

Fig. 15; 

1 Fig. 51 illustrates the conversion of l,Q data to binary data value using the format 
shown in Fig. 50; 

Fig. 52 is a more detailed block diagram of the symbol deinterleaving circuitry 
shown in Fig. 15; 

Fig. 53 is a more detailed block diagram of the bit deinterleaving circuitry shown 
in Fig. 15; 

" Fig. 54 illustrates the conversion from a coded constellation format to a 24 bit soft 
l/Q format by the bit deinterleaving circuitry shown in Fig. 53; 

Fig. 55 is a more detailed block diagram of the microprocessor interface of the 

receiver shown in Fig. 12; 

Fig. 56 is a more detailed block diagram of the system controller of the receiver 

shown in Fig. 12; and 

Fig. 57 is a state diagram relating to channel acquisition in the system controller 

of the receiver shown in Fig. 56. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
Alignment of The FFT Window 

Referring again to Figs. 3 and 4, according to the invention a statistical method is 
applied to COFDM signals to find the end of the guard interval 10. This method is 
explained with reference to the above noted European Telecommunications Standard, 
but is applicable to many forms of frequency division multiplexing having prefixed or 
postfixed guard intervals. It allows the receiver 32 to find the end of the guard interval 
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given only the received sampled complex signal ( solid line 12) and the size of the active 
interval 6 . The method relies on the fact that the guard interval 10 is a copy of the last 
part of the data symbol 4. In the receiver 32, due to echoes and noise from the channel 
and errors in the local oscillator, the guard interval 10 and the last part of the data 
5 symbol 4 will differ. If the errors introduced are random then a statistical method can be 
applied. According to the invention, the received complex signal is sampled at a rate 
which is nearly identical to that used in the transmitter. A difference signal is found for 
a pair of received samples which are separated by a period of time which is as close as 
• possible to the active interval 6, This period should be equal to the size of the fast 
10 fourier transform ("FFT") being applied (i.e. 2048 or 8192 samples). Let. 

S i - 'Si'-^-fftsize 1 (14) 

where S t is -the difference signal; $ and % Wsi2 e are the current and previous complex 
input samples of which the modulus is taken. That is ? the subscript T indexesa linear 

1 5 time sequence of. input values. Assuming that the input signal is random, then Sj is also, 
random. Within the guard interval s ( and Sj_ msi2e will be similar, although not identical, due 
to the effects of the channel. Sj will be therefore a random signal with a small dispersion. 
As used herein the term "dispersion" means generally the spread of values, and is not 
4 restricted to a particular mathematical definition. In general the active part of one symbol 

20 is not related to the active part of the next symbol. Outside of the guard interval Sj will 
be random with a much larger dispersion/In order to find the end of the guardjnterval, 
the dispersion of the difference signal Sj is monitored to look for a significant increase 
which will occur at the boundary of the guard interval 10 and the active interval 6. The 
inventors have also observed that a large ^decrease in dispersion is. seen at the start of 

25 the guard interval 10. 

According to a preferred embodiment of the invention samples of the input signal 
are stored over an interval which includes at least one symbol period T s . The dispersion 
of the difference signal S ( - is calculated over a block of samples. The block is moved 
back in time over a number of samples, n F and the dispersion is recalculated. These two 

30 blocks are referred to herein as "comparison blocks". The ratio of a current dispersion 
in a first comparison block to the dispersion in a previous comparison block is found. 
Then, the F ratio significance test is used to find significant differences in the 
dispersions of the two comparison blocks. The F ratio is defined as 
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where n is a positive integer, i indexes the input samples, and VAR(i) is the variance of 
a block of values of length N samples. Variance can be defined as 



(16) 



While the F ratio significance test is used in the preferred embodiment, other functions 
of the two dispersion values which give a signal relating to the change in dispers.on 
could be used. There are many such functions. An advantage of the F ratio is that for 
a random input signal it has a known probability distribution, allowing conven.ent 
statistical analysis for purposes of performance analysis and system design. Also the 
F ratio intrinsically normalizes the signal, making the result independent of the s.gnal 
level: 

The method is disclosed with reference, to Fig. 6, in which a first member of a 
sample pair in a current evaluation block is measured at step 38. A delay of one active 
interval 6 ( Fig. 3) is experienced in step 40. This may be accomplished wrth a d.g.tal 
delay such as a FIFO, or equivalent^ by buffering samples for an active interval .n a 
memory and accessing appropriate cells of the memory. A second member of the 
sample pair is measured in step 42, and the difference between the first and second 
member is determined and stored in step 44. The end of the current block is tested at 
decision step 46. The size of the evaluation block should not exceed the length of a 
guard interval, and may be considerably smaller. In the event the end of the current 
block has not yet been reached, another sample is acquired at step 48, and control 

returns to step 38. ' 

If the end of the current block has been reached, the dispersion of the current 
block is measured in step 50, and is treated as one of two comparison blocks of data. 
A test is made at decision step 52 to determine if a group of two companson blocks 
have been evaluated. If this test is negative, then another block of data is acqu.red .n 
step 54, after which control returns to step 38. The other block of data need not be 
contiguous with the block just completed. 

In the event the test at decision step 52 is positive, the F ratio is computed for the 
group of two comparison blocks at_step 56. The results obtained in step 56 are 
submitted to peak detection in step 60. Peak detection optionally includes stat.st.cal 
tests of significance, as is explained hereinbelow. 

If peaks are detected, then the boundary of a guard interval is established .n step 
62 for purposes of synchronization of the FFT window which is necessary for further 
signal reconstruction. If peaks are not detected, the above process is repeated with a 
block of samples taken from another portion of the data stream. 
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Example 1: 

Referring now to Fig; 7 a complex signal was generated according to the above 
noted European Telecommunications standardising a random number generator, and 
transmitted across a Ricean channel model together with added white Gaussian noise 
5 (SNR = 3.7); Data symbols were then analyzed according to the above described 
•method. The results 6 data symbols are shown in Fig. 7 f wherein the F ratio is plotted 
for convenience of presentation on a logarithmic axis as line 64, because the spikes 66, 
68, at the beginning and end of the guard intervals respectively, are very large. 

' Although it is quite evident from Figure 7 that the ends of the guard intervals are 
10 easy to find using any of several well known peak detectors, it is possible to apply a 
statistical test to more accurately answer the question: do the two blocks of samples 
have the same dispersion? This is the null hypothesis, H 0 , i.e. the dispersion is the 
same and the observed spike in F is due to random fluctuations only. If H 0 has very low 
probability it can be rejected, which would correspond to detection of the start or end of 
1 5 the guard interval. From the way the COFDM symbol is constructed H 0 is expected to 
* be trtie for comparison blocks lying entirely within the guard interval or within the active 
interval, but false when the comparison blocks straddle a boundary at the start or end 
1 of the guard interval. If comparison blocks of random samples are drawn from the same 
population then the probability of F is given by 
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Q(F\v v v 2 ) = 1^,-1) (17) 



where l() is the incomplete Beta function, 

25 x = (1 8) 

v 2 + Vl F 

and v 1 and v 2 are the number of degrees of freedom with which the first and second 
dispersions are estimated. In this example v1 = v2 = (N-1) if n >= N. The shape of the 
function is shown in Fig. 8. From a statistical point of view n should be sufficiently large 

30 so that the two blocks do not overlap, i.e. n N. If the blocks do overlap, then the 
calculation of the second dispersion will use samples used for the calculation of the first 
dispersion/This effectively reduces the number of degrees of freedom and hence the 
significance of the result. It has been determined that setting n=N works well. 

The function Q() in equation (13) actually gives the one-tailed probability. H 0 could 

35 be rejected if F is either very large or very small, and so the two-tailed test is required. 
Actually the two tails are identical, so for a two-tailed test the probability is double that 
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given in equation (13). However, this results in values of probability greater than one for 
F<1 . The probability, p, is therefore calculated as follows: 

P=2/x("^.-j) d9) 

and then, if (p > 1), P = 2 - p. This probability reflects the viability of H 0 . Thus if p is 
small, H 0 can be rejected and it can be stated, with a specified degree of certainty, that 
the comparison blocks come from sample populations with different dispersion. The 
noted European Telecommunications Standard specification states that the block size, 
N, should be 32 for a correlation algorithm. N={32,64} have been successfully tried. The 
probability functions obtained are shown in Fig. 9 using these values for N: In the 
preferred embodiment p <= 0.05 has been set for the rejection of H 0 . 

A precise implementation, would be to calculate F, then x, then the incomplete Beta 
function, then p and then apply the threshold test. This algorithm would be very difficult 
to realize in hardware since the Beta function is very complicated. In the preferred 
embodiment it is much simpler, and gives the same results, to set the acceptance 
threshold and N parameter, and thus define an upper and lower limit for F. It is then only 
necessary to calculate F and compare it with the limits, In order to simply find the end 
of the guard interval it may be safely assumed that F>1, Only the upper limit on F is 
needed. To calculate the limits on F accurately, a suitable root-finding method, such as 
Newton-Raphson may be utilized. Typical values are given in Table 1. 



p threshold 


v1 = v2 = 31 


v1 = v2 = 63 


F lower F_upper 


FJower 


F_upper 


0.2 


0.627419 


1 .593832 


0.722591 


1 .383909 


0.1 


0.548808 


1.822132 


0.658620 


1.518326 


0.05 


0.488143 


2.048582 


0.607525 


1 .646022 


0.01 


0.386894 


2.584689 


0.518205 


1 .929738 


0.005 


0.354055 


2.824422 


0.487936 


2.049448 


0.001 


0.293234 


3.410251 


0.429794 


2.326695 
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This method has been successfully tested using the specified channel model with 
additive white Gaussian noise (SNR=3.7). 

The formula for dispersion given in Equation (12) would require a multiplier for 
implementation in silicon. The calculation of F is a division in which the (N-1) 
normalisation constants cancel out as long as the two blocks have the same size. 
Accurate multiplication and division can be expensive in silicon. In the preferred 
embodiment simplifications have been implemented which give less accurate, but still 
Viable, values for F. S t can be assumed to have zero mean so it is not necessary to 
calculate the mean from the block- of-samples. This also increases the number of 
degrees of freedom from (N-1) to N.rlnstead of calculating variance: using the standard 
sum of squares formula, the dispersion can be estimated by the mean absolute 
deviation. The formula for VAR(i) becomes 
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VAR(i) = — 
N 



N-1 

E 



(20) 



20' 



The (1/N) factor divides out in the calculation of F if the two blocks have the same size, 
But there still remains the division of the two dispersions and the squaring required. 
These can be tackled using logarithms to the base 2. Substituting from Equation (16); 
into Equation (11) gives 
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F = 



/V-1 

E is;.,i 

/=o 

/V-1 

E ISUyl 



(21=) 
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Taking logs to the base 2 gives 



logF '= 2(logs.,-logs b ) = y -(22) 

It is then only necessary to calculate y and compare it with the logarithm to the base 2 
of the F upper limit. The comparison can be made by subtracting the log of the limit from 
2(log2sa-log2sb) and comparing with zero. The factor of 2 can be absorbed into the 
limit. 
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Calculation of the logs to base two is relatively straightforward in hardware if the 
numbers are stored as fixed point fractions. The fractions can be split into an exponent 
and a fractional mantissa: x = A2 B . Taking log base 2 gives logx = logA + B. Since A is 
fractional it is practical to find its logarithm using a lookup table. The exponent B can be 
found from the position of the MSB (since s a and s b will both be positive numbers). 

The calculation can thus be reduced to require only addition and subtraction 
arithmetic operations. The limit should also be recalculated using v1=v2=N if using this 
method. In practice, the significance level may be set empirically for a particular 
application, preferably p = 0.05. 

sit will be appreciated by those skilled in the art that various measures of dispersion 
may*be utilized without departing from the spirit of the invention, for example the 
standard deviation, skew, various moments, histograms, and other calculations known 
in the art. 

In a first alternate embodiment of the invention, the above described method is 
employed using either the real or the imaginary parts of the signal instead of the 
modulus. This embodiment achieves economy in hardware. 

In a second alternate embodiment of the invention, the n parameter of equation 
(1 1 ) has been optimized. At the end of the guard interval, the two blocks straddle more 
of the transition to the active interval, giving a well-defined increase in the dispersion. 
Using any value n>2 has the drawback that several successive points will give 
significant increases as the later block travels up to the boundary. This small problem 
is easily overcome by introducing a dead period after detection of the boundary. That 
is, once a spike has been detected a set of samples equal to the size of the FFT window 
is accepted before further attempts are made to locate another spike. The dead period 
has-the added benefit of not introducing false spikes. When using larger values of n the 
spikes 66, 68 ( Fig. 7) increase, whilst the H 0 noisy F signal remain much the same. 
Example 2: 

The maximum F-spike height as a function of n has been measured systematically 
together with the background variation in F. The results are shown in Table 2. 
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1 .0014 
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1.0014 


0.19 
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1.0012 


0.23 


20.9 
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40 


0.9975 


0.24 


22.0 


92 


"50 


0.9926 


0.25 


20.4 
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■*■ Table 2 was developed using the first 5 frames of the signal analyzed in Fig. 7. 
The statistics in columns (2) and (3): of Table 2 were made by excluding any points 
where F>=3.0 to exclude spikes from the calculations. The spikes would otherwise affect 
the values of mean and standard deviation even though they are from a different 
statistical population. 

The results indicate that the background variation in F, F sd . , was affected by n, 
increasing asymptotically to a value of approximately 0.28. It is likely that this is the 
effect of oviertapping blocks. For example, for N=64 and n<64, the blocks over which the 
dispersions are calculated will contain some of the same values and therefore be 
correlated. To test this theory Fs.d. was evaluated for n>N, and the results are shown 
in Table 3. 



Table 3 



n 


F_ 


" ' 60 


0.258 


70 


0:266 


80 


0.270 


90 


0.278 


100 


0.285 


- 128 


0.297 


256 


0.366 



The dependence becomes linear at n N/2. If F is calculated every n samples, 
rather than every sample, then this dependence may be reduced. However, this creates 
a' risk for small guard intervals of not having the first block wholly within the guard 
interval and the second wholly within the active interval. 

A third alternate embodiment of the invention is disclosed with reference to Fig. 1 0, 
which schematically illustrates a timing synchronization circuit 70. The circuit accepts 
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a complex input signal 72, and includes a circuit module 74 which develops the modulus 
of its input, which is taken from node 83. The circuit module 74 insures that the value 
being subsequently processed is an unsigned number. The input to the circuit module 
74 is a difference signal which is developed by a subtracter 75 which takes as inputs the 
input signal 72 and a delayed version of the input signal 72 which has been processed 
through a delay circuit 79, preferably realized as a FIFO 77 of length L, where L is the 
size of the FFT window. As explained above, it is also possible to operate this circuit 
where the input signal 72 is real, imaginary, or complex, or even the modulus of a 
complex number. In the case where the input signal 72 is real, or imaginary, the circuit 
module 74 can be modified, and can be any known circuit that removes the sign of the 
output of the subtracter 75, or equivalently sets the sign so that the outputs accumulate 
monotonically; i.e. the circuit has a unipolar output. The output of the circuit module 74 
is ultimately clocked into a digital delay, which is preferably implemented as a FIFO 78. 
When the FIFO 78 is full, a signal SIG1 80 is asserted, and the output of the FIFO 78 
becomes available, as indicated by the AND gate 82. An adder/subtracter circuit 84 is 
also connected to the node 76, and its output is stored in a register 86. A delayed 
version of the output of the adder/subtracter circuit 84 is taken from the register 86 and 
fed back as a second input to the adder/subtracter circuit 84 on line 88. In the event the 
signal SIG1 80 has been asserted, a version of the output of the circuit module 74, 
delayed by a first predetermined interval N, where N is the number of samples in the 
comparison blocks, is subtracted from the signal on node 76. 

The signal on line 88 is an index into a lookup table, preferably implemented as 
a read-only-memory ("ROM"), and shown as ROM 90,The address of the ROM 90 
contains the logarithm to the base 2 of the magnitude of the signal on line 88, which 
their appears at node 92. The node 92 is connected to a subtracter 94, and to a delay 
circuit, shown as FIFO 98, which is used to develop the denominator of the middle term 
of equation (17). 

The subtracter 94 produces a signal which is compared against the log 2 of a 
predetermined threshold value F UMIT in a comparison circuit 1 06, shown for simplicity 
as an adder 108 connected to a comparator 110. The output signal SYNC 112 is 
asserted when the boundary of a guard interval has been located. 

Although not implemented in the presently preferred embodiment, It is also 
possible to configure the size of the FIFO 77 dynamically, so that the size of the interval 
being evaluated can be adjusted according to operating conditions. This may 
conveniently be done by storing the values on the node 92 in a RAM 1 14 for computa- 
tion of their dispersion. 
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In a fourth alternate embodiment of the invention, explained with reference to Fig. 
1 1 , components similar to those of the embodiment shown in Fig. 10 have the same 
reference numerals. A timing synchronization circuit 116 is similar to the timing- 
synchronization circuit 70, except now the delay circuit 79 is realized as the FIFO 77, 
and another FIFO 1 00, one of which is selected by a multiplexer 102. Both of the FIFOs 
77, 100 provide the same delay; however the capacities of the two are different. The 
FIFO 100 provides for storage of samples taken in an interval equal to the size of the 
FFT window, and is normally selected in a first mode of operation, for example during 
channel acquisition, when it is necessary to evaluate an entire symbol in order to locate 
a boundary of a guard interval. In the noted European Telecommunications standard, 
up to 8K of data storage is required, with commensurate resource requirements. During 
subsequentoperation, the approximate location of the guard interval boundaries will be 
known from the history of the previous symbols. In a second mode of operation, It is 
therefore only necessary to evaluate a much smaller interval in order to verify' the exact . 
location of the guard interval boundary. The number of samples used in the computations 
of the dispersion can be kept to a small number, preferably 32 or 64, and the much 
smaller FIFO 77 accordingly selected to hold the computed values. The resources^ 
saved thereby can be utilized for other functions in the demodulator, and memory 
utilized by the larger FIFO 100 may also be reallocated for other purposes. 

A control block 81 optionally advances the evaluation interval relative to symbol 
boundaries in the data stream in successive symbols, and can also be used to delay for 
the dead period. Eventually the moving evaluation interval straddles the boundary of the 
current symbol's guard interval, and synchronization is then determined. The size of the 
evaluation interval is chosen to minimize the use of memory, yet to be Jarge enough to 
achieve statistical significance in the evaluation interval. The size of the evaluation 
interval, and the FIFO 77 may be statically or dynamically configured. 
Single Chip Implementation of a COFDM Demodulator 
Overview 

Referring initially to Fig. 12, there is shown a high level block diagram of a 
multicarrier digital receiver 126 in accordance with the invention. The embodiment- 
described hereinbelow conforms to the ETS 300 744 telecommunications standard (2K 
mode), but can be adapted by those skilled in the art to operate with other standards 
without departing from the spirit of the invention. A>radio frequency signal is received 
from a channel such as an antenna 128, into a tuner 130, which is conventional, and 
preferably has first and second intermediate frequency amplifiers. The output of the 
second intermediate frequency amplifier (not shown), is conducted on line 132 to an 
analog to digital converter 134. The digitized output of the analog to digital converter 1 34 
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is provided to block 136 in which l/Q demodulation, FFT, channel estimation and 
correction, inner and outer deinterleaving, and forward error correction are conducted. 
Carrier and timing recovery are performed in block 1 36 entirely in the digital domain, and 
the only feedback to the tuner 130 is the automatic gain control ("AGC") signal which 
is provided on line 138. A steady 20 MHz clock on line 140 is provided for use as a 
sampling clock for the external analog to digital converter 134. A host microprocessor 
interface 142 can be either parallel or serial. The system has been arranged to operate 
with a minimum of host processor support. In particular channel acquisition can be 
achieved without any host processor intervention. 

"The functions performed within the block 136 are grouped for convenience of 
presentation into a front end (Fig. 13), FFT and channel correction group (Fig. 14), and 

a back end (Fig. 15). 

As shown in Fig. 13, l/Q samples at are received by an IQ demodulator 1 44 from 
the analog to digital converter 134 (Fig. 12) on a bus 146 at a rate of 20 megasamples 
per second. An AGC circuit 148 also takes its input from the bus 146. A frequency rate 
control loop is implemented using a numerically controlled oscillator 150, which receives 
frequency error signals on line 152, and frequency error update information on line 154. 
Frequency and sampling rate control are achieved in the frequency domain, based on 
the pilot earner information. The frequency error signals, which are derived from the pilot 
carriers, and the frequency error update information will both be disclosed in further 
detail shortly. The I and Q data output from the IQ demodulator 144 are both passed 
through identical low pass filters 156, decimated to 10 megasamples per second, and 
provided to a sine interpolator 158. Sample rate control is achieved using a numerically 
controlled oscillator 160 which receives sample rate control information derived from the 
pilot-signals on line 162. and receives sample error update timing information on line. 
1 64: ' 

As shown in Fig. 14, acquisition and control of the FFT window are performed in 
block 166, which receives signals from the sine interpolator 158 (Fig. 13). The FFT 
computations are performed in FFT calculation circuitry 168. Channel estimation and 
correction are performed in channel estimation and correction block 170, and involves 
localization of the pilot carriers, as will be described below in greater detail. The tps 
information obtained during pilot localization is processed in tps sequence extract block 
172. Uncorrected pilot carriers are provided by the circuitry of channel estimation and 
correction block 170 to correction circuitry 174, which develops sampling rate error and 
frequency error signals that are fed back to the numerically controlled oscillators 150, 
160 (Fig. 13). 
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Referring to Fig. 15, corrected I and Q data output from channel estimation and 
correction block 170 are provided to demapping circuitry 176. The current constellation 
and hierarchical constellation parameters, derived from the tps data, are also input on 
lines 178, 180. The resulting symbols are deinterleaved in symbol deinterleaver 182, 
utilizing a 1512 x 13 memory store. One bit of each cell in the memory store is used to 
flag carriers having insufficient signal strength for reliable channel correction. Bit 
deinterleaver 184 then provides deinterleaved I and Q data to a Viterbi Decoder 186, 
which discards the flagged carriers, so that unreliable carriers do not influence traceback 
metrics. A Forney deinterleaver 188 accepts the output of the Viterbi Decoder 186 and 
is coupled to a Reed-Solomon decoder 190. The forward error correction, provided by 
the Viterbi and Reed-Solomon decoders is relied upon to recover lost data in the case 
of flagged carriers. 

Referring to Fig. 16, in the presently preferred embodiment a mean value is 
calculated in block 192 for uncorrected carriers with reference to the previous symbol.- 
Data carriers whpse interpolated channel response falls below some fraction, preferably^ 
0.2, of this mean will be marked with a bad_carrier flag 194. The bad_carrier f lag % 1 94. 
is carnednth rough the demapping circuitry 176, symbol deinterleaver 182, and bit 
deinterleaver 184, to the Viterbi Decoder 186 where it is used to discard data relating 
to the unreliable carriers. The parameters used to set the bad_carrier flag 194 can be 
varied by the microprocessor interface 142. 

An output interface 196 produces an output which can be an MPEG-2 transport 
stream. The symbol deinterleaver 182, and the bit deinterleaver 184 are conventional. 
The Viterbi decoder 186, Forney deinterleaver 188, Reed-Solomon decoder .190,' and 
the output interface 196 are conventional. They can.be the components disclosed in 
copending Application No. 638,273, entitled u An Error Detection and Correction System, 
for a Stream of Encoded Data", filed April 26, 1996, Application No. 480,976, entitled 
■Signal Processing System", filed June 7, 1995, and Application No. 481,107, entitled 
"Signal Processing Apparatus and Method", filed June 7, 1995, all of which are 
commonly assigned herewith, and are incorporated herein by reference. The operation 
of the multicarrier digital receiver 126 (Fig. 12) is controlled by a system controller 198. 

Optionally the hierarchical constellation parameters can be programmed to speed 
up channel acquisition, rather than derived from the tps data. 

The input and output signals and the register map of the multicarrier digital 
receiver 126 are described in tables 4, and 5 respectively. 
Automatic Gain Control 

The purpose of the AGC circuit 148 (Fig. 1 3)is to generate a control signal to vary 
the gain of the COFDM input signal to the device before it is analog-to-digital converted. 
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As shown in greater detail in Fig. 17, a Sigma-Delta modulator 200 is used to provide 
a signal which can be used as a gain control to a tuner, once it has been low-pass 
filtered by an external R-C network. 

The magnitude of the control voltage signal 202 is given by: 

5 control_voltage = control_voitage - error (23) 

where 

error = K(ldatal - mean) (24) 

where K is a constant (normally K«1) which determines the gain in the AGC control 
10 loop". The mean value can be determined from the statistics of Gaussian noise, which 

is a close approximation to the properties of the COFDM input signal, where the input 

data is scaled to +/-1 . The control voltage signal 202 is set back to its initial value when 

the signal resync 204 is set low, indicating a channel change or some other event 

requiring resynchronization. 
15 ; The input and output signals and the registers for the microprocessor interface 1 42 

of the AGC circuit 148 are described in tables 6, 7, and 8 respectively. 

IQ Demodulator 

The function of the IQ demodulator 144 (Fig. 13) is to recover in-phase and 
quadrature components of the received sampled data. It is shown in further detail in Fig. 
20 18. 

The numerically controlled oscillator 150 generates in-phase and' quadrature 
sinusoids at a rate of (32/7) MHz, which are multiplied with data samples in multipliers 
206. The address generator 208 advances the phase linearly. The frequency error input 
21 ©'increments or decrements the phase advance value. The samples are multiplied 

25 with 'the sinusoids in the multipliers 206using 1 0 bit x 1 0 bit multiply operations. In one 
embodiment the IQ demodulator 144 is operated at 20 MHZ and then retimed to 40MHz 
in retiming block 21 2. In a preferred embodiment the IQ demodulator 1 44 is operated 
at 40MHz, in which case the retiming block 212 is omitted. 

Sinusoids are generated by the address generator 208 on lines 214, 216. The 

30 phase value is employed as an address into a lookup table ROM 218. Only quarter 
cycles are stored in the lookup table ROM 218 to save area. Full cycles can be 
generated from the stored quarter cycles by manipulating the data from the ROM 218 
and inverting the data in the case of negative cycles. Two values are read from the 
lookup table ROM 218 for every input sample ~*a cosine and a sine, which differ in 

35 phase by 90 degrees. 

The input and output signals of the IQ demodulator 144 are described in tables 9 

and 10 respectively. 




26 

Low Pass Filter 

The purpose of the low pass filters 156 (Fig. 13) is to remove aliased frequencies 
after IQ demodulation - frequencies above the 32/7 MHz second IF are suppressed by 
40dB. I and Q data are filtered separately. The output data is decimated to 10 
5 megasamples per second ("Msps") because the filter removes any frequencies above 
1/4 of the original 20 Msps sampling rate. The filter is constructed with approximately 
60 taps which are symmetrical about the center, allowing the filter structure to be 
optimized to reduce the number of multipliers 220. Fig. 19 is a block diagram of one of 
the low pass, filters 156, the other being identical. Fig. 19 shows a representative 
10 symmetrical tap 222, and a center tap 224. The required filter response of the low pass 
filters 156 is shown in Fig. 20. 

The input and output signals of the low pass filters 156 are described in tables 1 1 
and 1 2 respectively. . . 

Resampling 

15 Referring to Fig. 13, the purpose of resampling is to reduce the 10 Msps data 

stream output from the low pass filters 156 down to a rate of (64/7) Msps, which is the 
nominal sample rate of the terrestrial digital video broadcasting ("DVB-T") modulator at 
the transmitter. 

■ Resampling is accomplished in the sine interpolator 158, and the numerically 

20 controlled oscillator 160. The latter generates a nominal 64/7 MHZ signal. The 
resampling circuitry is shown in further detail in Fig. 21. The numerically controlled 
oscillator 1 60 generates a valid pulse on line 226 and a signal 228 representing the 
interpolation distance for each 40MHz clock cycle in which a 64/7MHz sample' should 
be produced. The interpolation distance is used to select the appropriate set of 

25 ■ ^interpolating filter coefficients which:are stored. in coefficient ROMs 230. It should be 
noted that only the sine interpolator for I data is illustrated in Fig. 21 . The structures for 
Q data are identical. 

Fig. 22 illustrates the generation of the interpolation distance and the valid pulse. 
Nominally T s = 1/10 Msps, and T = 1/ (64/7) Msps. The sine interpolation circuit 

30 disclosed in our noted Application No. 08/638,273 is suitable, with appropriate 
adjustment of the operating frequencies. 

The input and output signals of the sine interpolator 158 and the numerically 
controlled oscillator 160 are described in tables 13 and 14 respectively. 
FFT Window . «* 

35 As has been explained in detail above, the function of the FFT Window function 

is to locate the "active interval" of the COFDM symbol, as distinct from the "guard 
interval". This function is referred to herein for convenience as "FFT Window". In this 
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embodiment the active interval contains the time domain representation of the 2048 
carriers which will be recovered by the FFT itself. 

The FFT window operates in two modes; Acquisition and Tracking. In Acquisition 
mode the entire incoming sample stream is searched for the guard interval/active 
interval boundary. This is indicated when the F-ratio reaches a peak, as discussed 
above. Once this boundary has been located, window timing is triggered and the 
incoming sample stream is searched again for the next.guard interval/active interval 
boundary. When this has been located the length of the guard interval is known and the 
expected position of the next guard/active boundary can be predicted. The FFT window 
function then switches to tracking mode. 

^This embodiment is similar to the fourth alternate embodiment discussed above 
in respect of the tracking mode. In tracking mode only a small section of the incoming 
sample stream around the point where the guard/active boundary is expected to be is 
searched. The position of the active interval drifts slightly in response to IF frequency 
and sampling rate offsets in the front-end before the FFT is calculated. This drift is 
tracked and FFT window timing corrected, the corrections being inserted only during the 
guard interval. 

It will be appreciated by those skilled in the art that in a practical single chip 
implementation as is disclosed herein, memory is an expensive resource in terms of 
chip area, and therefore must be minimized. Referring to Fig. 23, during Acquisition 
mode the FFT calculation process is not active so hardware can be shared between the 
FFT Window and the FFT calculation, most notably a 1024x22 RAM 232 used as a 
FIFO by the FFT Window, and selected for receipt of FFT data on line 234 by a 
multiplexer 236. Once in Tracking mode the FFT calculation process is active so that 
other control loops to recover sampling rate and frequency which depend on FFT data 
(e.g. pilots in the COFDM symbol) can initialize. Therefore tracking mode requires a 
dedicated tracking FIFO 238, which is selected by a multiplexer 240. 

The input and output signals, and signals relating to the microprocessor interface 
142 of the FFT Window circuitry shown in Fig. 23 are described in tables 15, 16, and 17 
respectively. 

In one embodiment a threshold level, set from statistical considerations^ applied 
to the F-ratio signal (see Fig. 7) to detect the negative and positive spikes which occur 
at the start and end of the guard interval respectively. The distance between the spikes 
is used to estimate the guard inteival size. Repeated detection of the positive spikes is 
used to confirm correct synchronization. However with this method under noisy 
conditions the F-ratio signal becomes noisy and the spikes are not always reliably 
detectable. 
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In another embodiment peak detection is used to find the spikes in the F-ratios. 
It has been found that a fixed threshold is reliable only at or exceeding about a carrier- 
to-noise ("C/N") ratio of 12 dB. Peak detection is generally more sensitive and more 
specific, with generally reliable operation generally at 6 - 7 dB. The maxima should 
occur at the end of the guard interval. The difference in time between the two maxima 
is checked against the possible guard interval sizes. With an allowance for noise, the 
difference in time indicates the most likely guard interval size and the maxima 
themselves provide a good indication of the start of the active part of the symbol. 

Preferably this process is iterated for several symbols to confirm detection, and is 
expected to improve performance when- the C/N ratio is low. 

The data stream is passed to accumulators 242, 244, each holding 64 moduli. 
Conversion to logarithms and subtraction of the logarithms is performed in block 246. 
The peaks are detected in peak detector block 248. Averaging of the symbol peaks is 
performed in block 250. 

In noisy, conditions, the maxima may be due to noise giving possibly inaccurate/ 
indications of the guard interval length and the start of the active symbol. The general'* 
strategy to cope with this is to perform a limited number of retries. ; 

Currently, calculation of the F-ratio is done "on the fly" i.e. only once at each point. 
The variance estimates are calculated from 64 values only. Under noisy conditions, the 
variance- estimates become very noisy and the spikes can become obscured. In an 
optional variation this problem is solved by obtaining more values for the variance 
estimate, by storing the variance estimate during acquisition for each of the possible 
T+ Gniax points in the storage block 256. The variance estimates themselves may be 
formed by accumulating variances for each point, and then filtering in time over a 
number of symbols. A moving average filter or an infinite impulse response ("IIR") filter 
is suitable. A moving run of symbols, preferably between 16 and 32, are integrated in 
block 252, which increases the reliability of peak detection under noisy conditions. The 
storage block 256 holding the integrated F-ratio values is searched to find the maximum 
value. This is of length T+G max , where G max is the maximum guard interval size, T/4. 
Preferably the memory for storage block 256 is dynamically allocated, depending on 
whether acquisition mode or tracking mode is operative. Any unused memory is 
released to other processes. Similarly in tracking mode the integrated data stream is 
stored in tracking integration buffer 254. 

This method has been tested with up to 4 symbols, without an IIR filter, and it has 
been found that the spikes can be recovered. However this approach does require 
increased memory. 
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FFT Processor 



The discrete Fourier transform ("DFT") has the well known formula 



x(k) 
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k = 0,1 f .. M N-1 



(25) 
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where 



e N = the number of points in the DFT; 
x(k) = the kth output in the frequency domain; 
x(n) = the nth input in the time domain 



and 
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nk = o -j(2nnk/L) 



(26) 



is also known as a "twiddle factor". 
For N > 1000 the DFT imposes a heavy computational burden and becomes 
impractical. Instead the continuous Fourier transform is used, given by 



The continuous Fourier transform, when computed according to the well known FFT 

20 " algorithm, breaks the original N-point sequence into two shorter sequences. In the 
present invention the FFT is implemented using the basic butterfly unit 258 as shown 
in Fig. 24. The outputs C and D represent equations of the form C = A + B, and D = (A - 
B)W\ The. butterfly unit 258 exploits the fact that the powers of W are really just 
complex additions or subtractions. 

25 ;rA real-time FFT processor, realized as the FFT calculation circuitry 1 68 (Fig. 14) 

is a key component in the implementation of the multicarrier digital receiver 126 (Fig. 
12). Known 8K pipeline FFT chips have been implemented with 1.5M transistors, 
requiring an area of 100 mm 2 in 0.5p technology, based on the architecture of Bi and 
Jones. Even using a memory implementation with 3-transistor digital delay line 

30 techniques, over 1M transistors are needed. This has been further reduced with 
alternative architecture to 0.6M, as reported in the document A New Approach to 
Pipeline FFT Processor Shousheng He and Mats Torkelson, Teracom Svensk 
RundRadio. DTTV-SA 180, TM 1547. This document proposes a hardware-oriented 
radix-2 2 algorithm, having radix-4 multiplicative complexity. However the requirements 

35 of the FFT computation in the present invention require the implementation of a radix 
2 2 +2 FFT processor. 
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Referring to Fig. 25 and Fig. 26 the butterfly structures BF2I 260 and BF2II 262 t 
known from the noted Torkelson publication, are shown. The butterfly structure BF2II 
262 differs from the butterfly structure BF2I 260 in that it has logic 264 and has a 
crossover 266 for crossing the real and imaginary inputs to facilitate multiplication by -j. 
5 Fig. 27 illustrates the retimed architecture of a radix 2 2 + 2 FFT processor 268 in 

accordance with the invention, which is fully pipelined, and comprises a plurality of 
stages, stage-0 270 through stage-6 272. Except for stage-0 270, the stages each 
comprise one butterfly structure BF2I 260 and one butterfly structure BF2II 262, and 
storage RAMS 274, 276 associated therewith, stage-0 270 only has a single butterfly 

10 structure BF2I 260. This architecture performs a straight-forward 32-point FFT. stage-6 
272 has control logic associated therewith, including demultiplexer 278 and multiplexer 
280, allowing stage-6 272 to be bypassed, thus providing a 2K implementation of the 
FFT. Counters 282 configure the butterfly structures BF2I 260 and BF2II 262 to select 
one of the two possible diagonal computations, during which data is being simulta- 

1 5 neously written to and read from the storage RAMS 274, 276. 

Fig. 28 illustrates a 32 point flow graph of the FFT processor 268 using radix 2 2 +2 
pipeline architecture. Computations are performed using eight 4-point FFTs and four 8- 
point FFTs. These are decomposed in turn into two 4-point FFTs and four 2-point FFTs. 
Fig. 29 illustrates the retimed architecture of a configurable 2K/8K radix 2 2 +2 

20 single path, delay feedback pipelined FFT proceissor 284, in which like elements in Fig. 
27 are given the same reference numerals. The stages have a plurality of pipeline 
registers 286 which are required for proper timing. of the butterfly structures BF2I 260 
and BF2II 262 in the various stages. As can. be seen, the addition of each pipelined 
stage multiplies the range of the FFT by, a factor of 4. There are 6 complex multipliers 

25 288, 290, 292, 294, 296, 298 which operate in parallel. This processor computes one . 
pair of l/Q data points every four fast clock.cycles, which isequivalent to the sample rate 
clock. Using 0.35pm technology the worst case throughput is 140ps for the 2K mode of 
/ operationrand 550ps for the 8K mode, exceeding the requirements of the ETS 300 744 
telecommunications standard. Data enters the pipeline from the left side of Fig. 29, and 

30 emerges on the right. The intermediate storage requirements are 2K/.8K for I data and 
2K/8K for Q data, and is mode dependent. In practice the radix-4 stage, is. implemented 
as a cascade of two adapted radix-2 stages that exploit the radix-4 algorithms to reduce 
the number of required complex multipliers. 

Fig. 30 is a schematic of one embodiment of the multipliers 288, 290, 292, 294, 

35 296, 298 for performing the complex multiplication C = A x B; where A is data, and B is 
a coefficient. Because the FFT processor 284 has 6 complex multipliers, each requiring 
3 hardware multipliers 300, a total of 18 hardware multipliers 300 would be required. It 
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is preferable to use the embodiment of Fig. 31 in which some of the hardware multipliers 
300 are replaced by multiplexers 302, 304. 

Turning again to Fig. 29 there are a plurality of RAMS 306, 308, 310, 312, 314, 
316 which are preferably realized as ROMs and contain lookup tables containing 
5 complex coefficients comprising cosines for the multipliers 288, 290, 292, 294, 296, 298 
respectively. It has been discovered that by addressing the RAMS 306, 308, 310, 312, 
31 4, 31 6 according to a particular addressing scheme, the size of these RAMS can be 
markedly reduced. The tradeoff between the complexity of the addressing circuitry and 
the reduction in RAM size becomes favorable beginning at stage-3 318. Referring again 

1 0 to Fig. 28 there are two columns 320, 322. Column 320 holds values W 2 - W 14 ,. followed 
by W 1 - W 7 , and then W 3 - W 21 . These coefficients are stored in the RAM 308, required 
by the particular multiplier 290. Column 322 contains values W 8 , W 4 , W 12 , which repeat 
3 times. Note further that between the values W 8 , W 4 , and W 4 , W 12 are connections 
324, 326 to the preceding butterfly unit located in column 328. In practice the - 

15 connections 324, 326 are implemented as multiplications by W°. In moving from 
multiplier to multiplier toward the left in Fig: 29, the lookup table space is multiplied by 
a power of 4 at each stage. In Fig. 32 table 330, the lookup table for multiplier M 3 
contains 512 entries. It can be deduced by extrapolation that multiplier M 5 must contain 
8192 twiddle factors, and corresponds to the size of the FFT being performed by the 

20 FFT processor 284 (Fig. 29). 

Before examining the look-up table space in more detail it is helpful to consider the 
plurality of horizontal lines 332. Moving downward from the top of Fig. 28, the line 
beginning at x(3) extends to W 8 , which is the first twiddle factor required, and is at the 
third effective step in the flow diagram. Figs. 33 and 32 show the organization of the 

25 twiddle factors for each of the multipliers, wherein the terminology M k represents the 
multiplier associated with the kth stage. Thus table 334 relates to multiplier M 0 . The 
notation for the W values (twiddle factors) is shown in box 336. The subscript "B" at the 
bottom right represents a time stamp, that is an order dependency in which the twiddle 
factors are required by the pipeline. The superscript "A" represents the address of the 

30 twiddle factor in its lookup table. The superscript "N" is the index of the twiddle factor. 

Thus in table 334 it may be seen that W° is required at time 0, W 1 at time 1 , and 
W° is again required at time 2. Further inspection of the other tables in Figs. 33, 32 
reveals that half of the entries in each table are redundant. The storage requirement for 
the lookup tables can be decreased by 50% by eliminating redundant entries. This has 
. 35 been accomplished by organizing the W values in ascending order by index, so that the 
values can be stored in memory in ascending order. Thus in the case of table 338 the 
index values range from 0 to 21, with gaps at 11, 13, 16, 17, 19, and 20. 
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The procedure for organizing the lookup table and the addressing scheme for 
accessing the twiddle factors is explained with reference to table 338, but is applicable 
to the other tables in Fig. 33. (1) Each row is assigned a line number as illustrated. (2) 
Each twiddle factor is assigned an order dependency which is noted in the lower right 
5 of its respective cell in table 338. (3) It is assumed that table 338 in its reduced form will 
contain only unique twiddle factors in ascending order by index within the memory 
address space. Consequently each twiddle factor is assigned a memory address as 
shown in the upper left of its respective cell. 

During address generation, for line 3 of table 338 the address is simply held at 0. 
1 0 For line 1 the address is incremented by 1 to the end of the line. However lines 0 and 
2 contain non-trivial address sequences. For line 0, looking at table 340, which contains 
64 values, it will be observed that the address sequence changes according to the 
intervaIs-2,2,2,2, and then later 1 ,1 ,2,1 ,1 ,2 ... For line 2, the address first increments by 
3, then by : 2, and finally by 1 . The locations at which the address increments change are 
1 5 referred to herein as the "break-points". These values of the break points range between" 
0, corresponding to the first point in line 2, to the last position in the line. 

By inspection it can be seen that the occurrence of the first break point changes' 
from table to table following the recurrence relationship 
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B1 m n =4B1 Mni (28) 

with the initial condition 

81 m 0 = 1 (29) 
where M N is the multiplier of the Nth stage of the FFT processor 284. 

Expanding the recurrence relationship gives: ' v 

B1 Mn ='(((4Bl Mo -1)x4-1)x4-1) .:. (30) 

30 B1 m n " 4 N B1 Mo -4 N - 3 -4 N - 2 ;..-4° - (31) 

B1 Mn = 4 n B1 Mn -]tV (32) 

n=0 
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Similarly the second break point B2 for line 2 is determined from the recurrence relation 

B2 m n =4B2 m n ., * 1 (33) 

with the initial condition 
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B2 Mq = 1 (34) 



or 

B2 Mm = (((4B2 Mo + 1)x4 + 1)x4*1) ... (35 ) 

5 

B2 Mn = E4" (36) 

Break point B3 for line 0 at which the sequence changes from increments of 
1 0 2,2,2,2 to the pattern 1 ,1 ,2,1 ,1 ,2 . . . can be located by inspecting tables 338, 340, and 
330f l ln table 338 the break point B3 occurs very late in the line, such that the second 
- sequence only presents its first two elements. By examining the address locations in the 
larger noted tables, it can be deduced that the location of break point B3 is related to 
the number of entries in a particular table as 



B3 = — + 2 (37) 



where K is the number of table entries. In the tables in Fig. 29 K = 8, 32, 128, 2048, 
20 8192. Therefore, in terms of the N'th complex multiplier, break point B3 can be 
expressed as 

B3 Mn = 2 x 4 N + 2 (38) 

where N > 0. 

25 M Address generators 342, 344, 346, 348 are operative for the lookup tables in 

RAMS 310, 312, 314, 316. Silicon area savings for the smaller tables 308, 306 are too 
small to make this scheme worthwhile. 

Fig. 34 schematically illustrates an address generator 342 for the above described 
address generation scheme, and is specific for the table 340 and multiplier M 2 . 128 

30 possible input states are accepted in lines in_Addr 350, and a multiplexer 352 selects 
the two most significant bits to decode 1 of 4 values. The output of the multiplexer 352 
relates to the line number of the input state. Actually the output is the address increment 
applicable to the line number of the input state, and is used to control a counter 354 
whose incremental address changes according to value on line 356. Thus, the 

35 increment for line 3 of table 340 is provided to the multiplexer 352 on line 358, and has 
a value of zero, as was explained above. Similarly the increment for line 1 of table 340 
is provided to the multiplexer 352 on line 360, and has a value of 1 . 
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The situations of line 0 and line 2 are more complicated. For line 0 the output of 
decoding logic 362 is provided by multiplexer 364, and has either an incremental value 
of 2, or the output of multiplexer 366. The latter could be either 1 or 2, depending on the 
state of a two bit counter 368, which feeds back a value of 0 or 1 as signal count 370. 
5 Decoding logic 372 decodes the states for line 2 of table 340. The relationship of 

the current input state to the two break points of line 2 are tested by comparators 374, 
376. The break point is actually set one sample earlier than the comparator output to 
allow for retiming. The outputs of the comparators 374, 376 are selectors for the 
< multiplexers 378, 380 respectively. 
10 The current address, held in accumulator 382 is incremented by the output of the 

multiplexer 352 by the adder 384. A simple logic circuit 386 resets the outgoing address, 
which is contained in register ACC 388, by asserting the signal rst 390 upon completion 
of each line of table 340. This insures that at the start of the next line the address points 
to twiddle factor W°. The new address is output on the 6 bit bus out_Address 392, which 
1 5 is one bit smaller than the input in_Addr 350. 

Fig. 35 is a generalization of address generator 342 (Fig. 34), in which the 
incoming address has a path of B bits. Like elements in Figs. 34 and 35 are given the 
same reference numerals. The structure of address generator 394 is similar tathat of 
- the address generator 342, except now the various lines of the input in_addr 396 and 
20 the output out_addr[B-2:0] 398 are denoted in terms of B. Thus the multiplexer 352 in 
Fig. 35 is selected by input in_addr [B-1:B-2] 400 . Similarly one of the inputs of 
comparator 374 and of comparator 376 is in_addr [B-3:0] 402. Out_addr[B-2:0] 398 
forms the output. The advantage of this structure is a reduction in the size of the lookup 
table RAM of 50%. 

25 . The FFT calculation circuitry 168 (Fig. 14) is disclosed in Verilog code listings 1 - . 

17. The Verilog code for the address generator, 394 is generic, enabling any power-of- 
four table to be implemented. 
Channel Estimation and Correction 

The function of the Channel estimation and correction circuitry shown in channel 

30 estimation and correction block 170 (Fig. 14) is to estimate the frequency .response of 
the channel' based on the received values of the continuous and scattered pilots 
specified in the ETS 300 744 telecommunications standard, and generate compensation 
coefficients which correct for the channel effects and thus reconstruct the transmitted 
spectrum. A more detailed block diagram of the channel estimation and correction block 

35 170 is shown in Fig. 16. 

In acquisition mode, the channel estimation and correction block 170 needs to 
locate the pilots before any channel estimation can take place. The circuitry performs 
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a convolution across the 2048 carriers to locate the positions of the scattered pilots, 
which are always evenly spaced, 1 2 carriers apart. Having found the scattered pilots, 
the continual pilots can be located; once this is done the exact position of the 1705 
active carriers within the 2048 outputs of the FFT calculation circuitry 168 (Fig. 14) is 
known. A timing generator 404 within the block can then be initialized, which then 
generates reference timing pulses to locate pilots for channel estimation calculation and 
for use in other functions of the demodulator as well. 

Channel estimation is performed by using the evenly spaced scattered pilots, and 
then interpolating between them to generate the frequency response of the channel. 
The received carriers (pilots and data) are complex divided by the interpolated channel 
response to produced a corrected spectrum. A complete symbol is held in a buffer 406. 
This corrects for the bit-reversed order of the data received from the FFT calculation 
circuitry 168. It should be noted that raw, uncorrected data is required by the frequency 
and sampling rate error circuitry. 

The task of synchronizing to the OFDM symbol in the frequency domain data 
received from the FFT calculation circuitry 168 (Fig. 14) begins with the localization of 
the scattered and continual pilots, which occurs in pilot locate block 408. Scattered 
pilots, which according to the ETS 300 744 telecommunications standard, occur every 
1 2 data samples, offset by 3 samples with respect to the start of the frame in each 
succeeding frame. As the power of the pilot carriers is 4/3 the maximum power of any 
data carrier, a succession of correlations are performed using sets of carriers spaced 
at intervals of 12. One of the 12 possible sets is correlates highly with the boosted pilot 
carrier power. 

A first embodiment of the pilot search procedure is now disclosed with reference 
to Figs. 36 and 1 6. It should be noted that the scattered pilot search procedure is done 
on the fly, and storage is only required in so far as is necessary to perform the 
subsequent step of continual pilot location discussed below. At step 410, after the 
assertion of the signal resync 204, generally occurring after a channel change or on 
power up, the signal pilotjock 41 2 is set low. Then, at step 41 4 the process awaits the 
first symbol pulse from the FFT calculation circuitry 168 (Fig. 14) on line 416 indicating 
the start of the first symbol. The first symbol is received and stored. In one embodiment 
of the pilot search procedure each point from 0 to 2047 is read in turn, accumulating 
each value (|l| + |Q|) in one of 12 accumulators (not shown). The accumulators are 
selected in turn in a cycle of 12, thus convolving possible scattered pilot positions. Two 
well known peak trackers indicate the accumulator with highest value (Peakl ) and the 
accumulator having the second" highest value (Peak2). The accumulator having the 
highest value corresponds to the scattered pilot orientation. The second highest value 
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is tracked so that the difference between the highest peak and the second highest peak 
can be used as a "quality" measure. At decision step 418, if the two peaks are not far 
enough apart, a test for completion of a full range frequency sweep is made at decision 
step 420. If the test fails, failure of the scattered pilot search is reported at step 422. 
Otherwise, at step 424 the IQ Demodulator LO frequency is incremented by +1/8 carrier 
spacing by incrementing the magnitude of the control signal freq_sweep 426. Then the 
search for scattered pilots is repeated after delaying 3 symbols at step 428 to allow time 
for the effect of the change to propagate through the FFT calculation circuitry 168 and 
buffers. The peak difference threshold can be altered by the control microprocessor via 
the microprocessor interface 1 42 and block 430. 

In a variation of the first embodiment there is only a single peak tracker which 
indicates the accumulator with highest value, which corresponds to the scattered pilot 
orientation. The' true scattered pilot orientation thus found is one of 12 possible 
orientations. 

If the test at decision step 418 is successful; the search for continual pilots is 
begun at step 432 by establishing an initial pilot offset from the 0 location in the RAM] 
storing the FFT data, according to the formula 

pilot offset = (accumulator # mod 3) (39) 

Thus, if the scattered pilot peak is in accumulator 0, 3, 6 or 9 the pilot offset is 0. If the 
scattered pilot peak is in accumulator 1, 4, 7, or 10 then pilot offset is 1, etc. Then 45 
carrier positions expected for continual pilots are read, adding the pilot offset value to 
the address, and accumulating + |q|) values. This procedure is repeated until first 
115 continual pilot start positions have-been searched. From the ETS 300 744 tele- 
communications standard the number of possible first carrier positions among the active" 
carriers lying in a contiguous block between carrier 0 and carrier 2047 is easily 
calculated as (2048-1705) / 3 *1 15, as explained below. It is thus guaranteed that the 
active interval begins within the first (2048-1705) carrier positions. The carrier 
corresponding to the peak value stored is the first" active carrier in the symbol. 

/Upon completion of the continual pilot search, at step 434 the timing generator 404 
is reset to synchronize to the first active carrier and scattered pilot phase. The signal 
pilotjock 412 is then set high at step 436, indicating that the pilots have been located 
successfully, then at step 436 the timing generator 404 is reset to synchronize to the 
first active carrier and scattered pilot phase. 

In a tracking mode of operation, shown as step 438, the scattered pilot search is 
repeated periodically, and evaluated at decision step 440. This can be done at each 
symbol, or less frequently, depending upon propagation conditions. The predicted 
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movement of the scattered pilot correlation peak is reflected by appropriate timing in the 
timing generator 404, and can be used as a test that timing has remained synchronized. 
Failure of the test at decision step 440 is reported at step 442, and the signal pilotjock 
412 is set low. 

5 A second embodiment of the pilot search procedure is now disclosed with 

reference, to Figs. 16 and 37. At step 444 the assertion of the signal resync 204 ( 
generally occurring after a channel change or on power up, the signal pilotjock 412 is 
set low. Then, at step 446 a symbol is accepted for evaluation. A search for scattered 
pilots, conducted according to any of the procedures explained above, is performed at 
10 step 448. Then a search for continual pilots is performed as described above at step 
450: s At decision step 452 it is determined whether two symbols have been processed. 
If thttest fails, control returns to step 446 and another symbol is processed. If the test 
succeeds at step 454 another test is made for consistency in the positions of the 
scattered and continual pilots in the two symbols. If the test at step 454 fails, then the 
15 procedure beginning with decision step 420 is performed in the same manner as 
previously described with reference to Fig. 36. If the test at step 454 succeeds at step 
.456 the timing generator 404 is reset to synchronize to the first active carrier and 
scattered pilot phase. The signal pilotjock 41 2 is then set high at step 458, indicating 
that the pilots have been located successfully. 
20 In a tracking mode of operation, shown as step 460, the scattered pilot search is 

repeated periodically, and evaluated at decision step 462. This can be done at each 
cycle of operation, or less frequently, depending upon propagation conditions. The 
predicted movement of the scattered pilot correlation peak is reflected by appropriate 
timing in the timing generator 404, and can be used as a test that timing has remained 
25 synchronized. Failure of the test at decision step 462 is reported at step 464, and the 
signal pilotjock 412 is set low. 

It will be appreciated that after the scattered pilots have been located, the task of 
locating the continual pilots is simplified considerably. As the continual -pilots are 
inserted at a known sequence of positions, the first of which-is offset by a multiple of 3 
30 positions with respect to start of the frame, as specified by the ETS 300 744 telecommu- 
nications standard. Two of three possible location sets in the data space can therefore 
be immediately excluded,~and it is only necessary to search the third set. Accordingly 
the continual pilot search is repeated, each iteration beginning at a location 3 carriers 
higher. New accumulated values and the current start location are stored if they are 
35 larger than the previous accumulated value. This is repeated until all continual pilot start 
positions have been searched. The carrier corresponding to the largest peak value 
stored will be the first active carrier in the symbol. It is unnecessary to evaluate the 
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"quality" of the continual pilot correlation peak. The scattered pilot search represents a 
correlation of 142 samples, and has higher noise immunity that of the search for 45 
continual pilots. The continual pilot search is almost certain to be succeed if scattered 
pilot search completed successfully, 
5 The above sequences locate scattered pilot positions within 1/4 symbol period, 

assuming accumulation at 40MHz, and locate continual pilots in less than 1 symbol 
period (45 x 1 15 clock cycles assuming 40MHz operation). 
■ ■ . - The I and.Q data is provided to the pilot locate block 408 by the FFT calculation 
circuitry 168 (Fig. 14) in. bit-reversed order on line 416. This complicates the problem of 

10 utilizing a minimum amount of RAM while computing the correlations during pilot 
localization. Incoming addresses are therefore bit reversed, and computed modulo 12 
in order to determine which of 12 possible bins is to store the. data. Jn order to avoid the 
square root function needed to approximate the carrier amplitude, the absolute values 
of the data are summed instead as a practical approximation. The scattered pilots are 

15 • determined M on the fly". The continual pilots are located on frames which succeed the, 
frames in which the scattered pilots were located. ± 
• . The operation of the timing generator 404 is now disclosed in further detail. The, 
addressing sequence for the RAM buffer 406 is synchronized by a symbol pulse from 
the FFT calculation circuitry 168 (Fig. 14). The FFT calculation process runs cbntinu- 

20 ously once the first symbol from has been received following FFT Window acquisition. 
Addressing alternates between bit-reversed and linear addressing for successive 
symbols. The timing generator 404 also generates all read-write timing pulses.- 

Signals u_symbol 466 and c_symbol 468 are symbol timing pulses indicating the 
start of a new uncorrected symbol or corrected symbol. The signal u_symbol 466 is 

25* delayed by latency of the interpolating filter 470 and the complex.multiplier 472, which, 
are synchronized to RAM Address Sequence Timing. 

- For carrier timing the signals c_carrier0 474, pilot timing signals us_pilot(+) 476, 
uc_pilot(+) 478, c_tps_pilot(*) 480 and odd_symbol pulse 482 are referenced to a 
common start pulse sequence. A base timing counter (not shown) is synchronized by 

30 the pilot locate sync timing pulse 484, and is therefore offset from symbol timing. Pilot 
timing outputs are also synchronized to uncorrected symbol output from the buffer 406 
or the corrected symbol output delayed by the interpolating filter 470 and the complex 
multiplier 472. On assertion of the signal resync 204 all timing output is set to inactive 
states until the first symbol is received. Let the transmitted pilot at carrier k be P k and 

35 the received pilot be P' k . 
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P k ' = H k • w k • P k (40) 

where P k is described below, and 

P'k = "k + )Qk (41) 

where k indexes pilot carriers, H k is the channel response and w k is the reference 
sequence. We interpolate H k to generate compensation values for the received data 
carriers, D' k : 

Dk = l k + j<V (42) 

D k = -*i (43) 
H k 

15 . where k indexes data carriers. Received pilots can be demodulated using a locally 
generated reference sequence and are then passed to the interpolating filter. 

The interpolating filter 470, realized in this embodiment with 6 taps and 12 
'coefficients, is utilized to estimate the portion of the channel between the scattered 
pilots. As explained above pilots are transmitted at known power levels relative to the 

20 data carriers and are modulated by a known reference sequence according to the ETS, 
300 744 telecommunications standard'. The transmitted pilot carrier amplitudes are ± 4/3 
of nominal data carrier power (44/3 for reference bit of 1, -4/3 for the reference bit of 0; 
quadrature component = 0 in both cases). Interpolation coefficients are selected from 
the 0-1 1 cyclic count in the timing generator 404 synchronized to data availability. 

25 Appropriate correction factors may be selected for data points to provide on-the-fly ; 
correction. The coefficients vary depending on scattered pilot phase. Since the positions 
of reference pilots vary, therefore coefficients to compensate a given data carrier also 
vary. 

The input and output signals, and signals relating to the microprocessor interface 
30 142 of the channel estimation and correction block 170 are described in tables 18, 19 
and 20 respectively. The circuitry of the channel estimation and correction block 170 is 
disclosed in Verilog code listings 1 8, and 1 9. 
TPS Sequence Extract 

The tps sequence extract block 172 (Fig. 14), although set out as a separate block 
35 for clarity of presentation, is in actuality partially included in the channel estimation and 
correction block 170. It recovers the 68-bit TPS data carried in a 68-symbol OFDM 
frame, and is shown in further detail in Fig. 38. Each bit is repeated on 17 differential 
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binary phase shift keyed ("DBPSK") modulated carriers, the tps pilots, within a COFDM 
symbol to provide a highly robust transport channel. The 68-bit tps sequence includes 
14 parity bits generated by a BCH code, which is specified in the ETS 300 744 
telecommunications standard. Of course appropriate modifications can be made by 
5 those skilled in the art for other standards having different BCH encoding, and for 
modes other than 2K mode. 

A clipper 486 clips incoming corrected spectrum data to ±1. The sign bit can be 
optionally evaluated to obtain the clipped result. In comparison block 488 clipped 
received tps pilot symbols are compared against a reference sequence input. In the 
10 described embodiment a value of 0 in the reference sequence matches -1 in the pilot, 
and a value of 1 in the reference sequence matches +1 in the pilot. Majority vote 
comparisons are used to provide an overall +1 or -1 result . A result of +1 implies the 
same modulation as the reference sequence, and a result of -1 implies inverse 
modulation. 

15 * The DBPSK demodulator 490 converts the +/-1 sequence from the majority vote- 
;form to a binary form. The sequence converts to a value of 0 if the modulation in current' 
and previous symbols was the same, and to 1 if modulation between successive- 1 
symbols is inverted. * ; 
From an uninitialized condition a search for either of two sync words in 68-bit tps 
-20 - -sequence (4 x 68-bit = 1 superframe)-is-conducted in the frame synchronizer block 492. 
The synchronization words of a superframe are as follows: 

0011010111101110 sync word for frames 1 and 3 

1100101000010001 sync word for frames 2 and 4 : - 

Having acquired either sync word, a search for the other is conducted in the appropriate 
25 position in the next OFDM frame. On finding the second sync word synchronization is 
declared by raising the signal tps_sync 494. Data is then passed to the BCH decoder 
496, which operates on 14 parity bits at the end of an OFDM frame against received 
data in the frame. Errors are corrected as necessary. 

Decoded data is provided to output store block 498, which stores tps data that is 
30 found in a full OFDM frame. The output store block 498 is updated only at the end of an 
OFDM frame. Only 30 bits of interest are made available. Presently some of these bits 
are reserved for future use. The length indicator is not retained; 

The BCH decoder 496 has been implemented in a manner that avoids the 
necessity of performing the Berlekamp Algorithm and Chien Search which are 
35 conventional in BCH decoding. The Galois Field Multiplier used in the BCH decoder 496 
is an improvement of the Galois Field Multiplier which is disclosed in our copending U.S. 
Application No. 08/801,544. 
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The particular BCH code protecting the tps sequence is specified imthe ETS 300 
744 telecommunications standard as BCH (67,53.t=2), having a code generator 
polynomial 

h(x) = x 14 +x 9 +x 8 +x 6 +x 5 +x 4 +x 2 +x+1 (44) 

or equivalently 

h(x) = (x 7 +x 3 + 1) (x 7 +x 3 +x 2 +x + 1) (45) 

The left factor is used to generate the Galois Field which is needed for error detection. 
Referring to Fig. 39, this is calculated in syndrome calculation block 500 which can be 
implemented using a conventional feedback shift register to generate the a values. The 
first three syndromes are then computed by dividing the received signal R(x) by the 
values a 1 , a 2 , and a 3 , again using a conventional feedback shift register implementa- 1 
tion, as is well known in the art of BCH decoding. It can be shown that the syndromes 
are 



S 0 


= (a 1 ) e ° 


+ (a 1 ) 6 ' 


(46) 


s. 


= (a 2 ) e ° 


+ (a 2 ) 61 


(47) 


s 2 


- (a 3 ) e ° 


+ (a 3 ) 6 ' 


(48) 



During the syndrome computation the syndromes are stored in storage registers 
R[2:0] 502. 

Jn the event S 0 is 0, then it can be immediately concluded that there are no errors 
in the current tps sequence, and a signal is asserted on line 504 which is provided to, 
error detect block 506, and the data of the received signal R(x) either output unchanged 
or toggled according to the output of the error detect block 506 on line 508. As explained 
below, if 

3,0 30 = S 2 (49) 

then exactly one error is present, a condition which is communicated to the error detect 
block 506 on line 510. Otherwise it is assumed that two errors are present. More than 
two errors cannot be detected in the present implementation. 

In order to solve the system of three non-linear equations shown above, data flow 
from the registers R[2:0] 502 into search block 512 is enabled by a signal EOF 514,= 
indicating the end of a frame. Three feedback shift registers 516, 518, 520 having 
respective Galois Field multipliers 522, 524, 526 for a" 1 - a' 3 in the feedback loop are 
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initialized to 50H, 20H, and 3dH (wherein the notation "H" refers to ^hexadecimal 
numbers). The feedback shift registers 516, 518, 520 are clocked each time a new data 
bit is available. The syndromes and outputs of the feedback shift registers 51 6, 51 8, 520 
are clocked into to a search module, which performs a search for the error positions 
5 using an iterative substitution search technique, which will now be described. The 
outputs of feedback shift registers. 5 16, 518 are multiplied in a Galois Field Multiplier 
528. 

Considering the case of one error, S 0 is added, modulo 2, preferably using a 
network of XOR gates 530 f to the output of the first feedback shift register 516 (a-gen 0 ). 
10 If the relationship 

( S 0^gen 0 ) =0 (50)" 

holds, it is concluded that there is an error in the present data bit. The bit being. currently 
output from the frame store is toggled. The search is halted, and the data is output from 
15 the frame store. 

Considering the case of two errorsrif the following relationship holds, there is an 
error in the current bit being output from the frame store: 

(3 0 + cc geno ) O (S 1+ a geni ) = (S 2 + a gena ) (51) 

20 ^ It is now necessary to store the three terms calculated in the immediately preceding.. 

equation into the registers R[2:0] 502 which previously stored the syndromes S 0 - S 2 . 

This is represented by line 532. 

The process continues, now looking for the second error, and reusing- the data in 

registers R[2:0] 502, which now contains the syndromes as adjusted by the previous 
25 iteration. The adjusted syndromes are denoted S 0 ' - S 2 '. 

S 0 =(S 0 + a geno ) .etc.. . (52)* 

Now, if 

30 (So' + <W =0 (53) 

the second error has been found, and the bit being currently output from the frame store 
is toggled by XOR gate 534. If the search fairs, more than two errors may be present 
and an error signal (not shown) is set. 

the Galois Field Multiplier 528 is a clocked digital circuit and is disclosed with 
35 reference to Fig. 40. The tps data is received very slowly, relative to the other processes 
occurring in the multicarrier digital receiver 126. It is thus possible to execute the 
iterative substitution search slowly, and the Galois Field Multipliers are designed for 
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minimum space utilization. They do not require alpha generators, but rely on small 
constant coefficient multipliers, with iterative feedback to produce the required alpha 
values. The arrangement takes advantage of the relationship in Galois Field arithmetic 

cc n = a 1 • a"' 1 (54) 

5 

After initialization by . a signal init 536 which selects multiplexers 538, 540, the 
multiplicand A 542 is accumulated in register 544 and repeatedly multiplied by the value 
a 1 in multiplier 546. The output on line 548 is repeatedly ANDed bitwise with the 
multiplicand B held in a shift register 550. The output of the shift register is provided on 

10 a one bit line 552 to the gate 554. The output of the gate 554 is accumulated in register 
556 t using the adder 558. 

..Jhe. input and output signals and signals relating to the microprocessor interface 
142 of the tps sequence extract block, 172 are described in tables 21, 22, and 23. 
Circuitry of the tps sequence extract block 172 and the BCH decoder 496 is disclosed 

15 in Verilog code listings 20 and 21. 

Automatic Fine Frequency Control and Automatic Sampling Rate Control 

A non ideal oscillator present in the transmission chain of an orthogonal frequency 
.division multiplexed ("OFDM") signal affects all carriers in the OFDM symbols. The 
OFDM carriers adopt the same phase and frequency disturbances resulting from the 

20 noisy local oscillator. Variations in the frequency of the Local Oscillator lead to phase 
shifts, and consequent loss of orthogonality within the OFDM symbol. Therefore 
competent automatic frequency control is required in the receiver to track the frequency 
offsets relative to the transmitter in order to minimize these phase 'shifts and hence 
maintain orthogonality. 

25 vAII the carriers within an OFDM symbol are equally affected by the phase shifts. 

Thisjs similar to the common phase error caused by phase noise. The common phase 
error present on all carriers is used to generate an Automatic Frequency Control ("AFC") 
signal, which is completely in the digital domain, since l/Q demodulation is performed 
in the digital domain. The approach taken is the calculation of the common phase error 

30 for every OFDM symbol. This is achieved by using the reference pilots. The change in 
the common phase error is measured over time to detect a frequency offset and is used 
to derive the AFC control signal. The generic approach for the AFC control loop and the 
automatic sampling rate control loop disclosed below is illustrated in Fig. 41 . 

Automatic sampling rate control is required when the receiver's master clock is not 

35 aligned with that of the transmitter. The misalignment causes two problems: (1 ) the 
demodulating carriers have incorrect spacing; and (2) the interval of the FFT calculation 
is also wrong. 
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The effect of this timing error is to introduce a phase slope onto the' demodulated 
OFDM data. This phase slope is proportional to the timing error. The phase slope can 
be determined by calculating the phase difference between successive OFDM symbols, 
using reference pilots, and estimating the slope of these phase differences. A least 
squares approach is used for line fitting. The ASC signal is low-pass filtered and fed 
back to the sine interpolator 158 (Fig. 13). v 
The mean phase difference between the reference pilots in subsequent OFDM 
symbols is used to calculate the frequency deviation. Assuming that the frequency 
"deviations of the local oscillator are constant, then the phase rotates with a, where 
a = 2nf d mT t rads. Here f d is frequency deviation, m is the number of symbols between 
repetitions of identical pilot positions, and T t is~the period comprising the sum of the 
active interval arid the guard interval. The AFC signal is generated overtime by low pass 
filtering a. The value of the frequency deviation is then used to control the IQ 
demodulator 144 (Fig. 13). 

The AFC and ASC control signals are effective only when a guard interval is 
passing indicated by the assertion of signal IQGI on line 154 (Fig. 13). This prevents a 
symbol -from being processed under two different conditions. * 

The correction circuitry 174 (Fig. 14) is shown in greater detail in Fig. 42. 
Frequency error values output on line 560 are calculated by determining the average of 
the f differences of phase values of corresponding pilots in a current symbol and the 
previous symbol. The resulting frequency error value is filtered in low pass filter 562 
before being fed-back to the IQ demodulator 144 (Fig. 13). It is optional to also evaluate 
coritinual pilbts in order to cope with larger frequency errors. Sampling rate error ( "output 
on line 564 is determined by looking at the phase difference between pilots in a symbol 
and the same pilots in a previous symbol. The differences vary across the symbol; 
giving a number of points through which a line can be fitted using the well known 
method of least squares regression. The slope of this line is indicative of the magnitude 
and direction of the sampling rate error. The sampling rate error derived in this way is 
filtered in low pass filter 566 before being fed back to the sine interpolator 158 (Fig. 13). - 

A separate store 568 for the scattered pilots contained in 4 symbols is shared by 
the frequency error section 570 and the sampling rate error section 572. Direct 
comparison of scattered pilot symbols is thereby facilitated, since the scattered pilot 
phase repeats every four symbols. In an alternate embodiment where scattered pilots 
are used to provide control information, storage must be provided for four symbols. In 
the preferred embodiment, wherein control information is derived from continual pilots, 
storage for only one symbol is needed. 
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Recovery of the angle of rotation a from the I and Q data is accomplished in the 
phase extract block 574, where 

a = tan" 1 (Q/l) (55) 

In the presently preferred embodiment, the computations are done at a resolution of 14 
bits. The phase extract block 574 is illustrated in greater detail in Fig. 43. The quadrant 
of a is first determined in block 576. The special cases where I or Q have a zero 
magnitude or I = Q is dealt with by the assertion of signals on lines 578. If the magnitude 
of Q exceeds that of I, quotient inversion is accomplished in block 580, utilizing a control 
signal 582. A positive integer division operation is performed in division block 584. 
Although this operation requires 11 clock cycles, there is more than enough time 
allocated for phase extraction to afford it. The calculation of the arctangent of the 
quotient is accomplished by a pipelined, truncated iterative calculation in block 586of the 
Taylor Series 

tan-(x) =x-*^-4r^---- W< 1 < 56 > 

\ . 3 5 7 y .. 

Block 586 is shown in greater detail in the schematic of Fig. 44. The value x 2 is 
calculated once in block 588 and stored for use in subsequent iterations. Powers of x 
are then iteratively computed using feedback line 590 and a multiplier 592. The divisions 
are calculated using a constant multiplier 594 in which the coefficients are hardwired. 
The sum is accumulated using adder/subtractor 596. The entire computation requires 

47 -'48 clock cycles at 40 MHz. 

- Turning again to Fig. 43, quadrant mapping, and the output of special cases is 
handled in block 598 under control of block 576. It may be noted that the square error 
of the result of the Taylor Expansion rises rapidly as a approaches 45 degrees, as 
shown in Fig. 45 and Fig. 46, which are plots of the square error at different values of 
a of the Taylor expansion to 32 and 31 terms respectively. The Taylor expansions to 31 
and 32 terms are averaged, with the result that the square error drops dramatically, as 
shown in Fig. 47. A memory (not shown) forjnolding intermediate values for the 
averaging calculation is provided in block 598. 

Constant Phase Error across all scattered Pilots is due to frequency offset at IQ 
Demodulator. Frequency Error can be defined as: 
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where a, m and T, have the same meanings as given above, a is determined by taking 
the average of the difference of phase values of corresponding pilots between the 
current symbol and a symbol delayed for m symbol periods. In the above equation, m 
= 1 in the case of continual pilots. This computation uses accumulation block 600 which 
5 accumulates the sum of the current symbol minus the symbol that preceded it by 4. 
Accumulation block 602 has an x multiplier, wherein x varies from 1 to a minimum of 
142 (in 2K mode according to the ETS 300 744 telecommunications standard). The low 
pass filters 562, 566 can be implemented as moving average filters having 10-20 taps. 
The data available from the accumulation block 602 is the accumulated total of pilot 
10 phases each sampled m symbols apart. The frequency error can be calculated from . 



j = Acc{new-old} 
(N)(2)nmT t 



err , K .wox -r ,(58) 



N = 142 in the case of scattered pilots, and 45 for continual pilots, assuming 2K 

15 mode of operation according to the ETS 300 744 telecommunications standard. The 
technique for determining sampling rate error is illustrated in Fig. 48, in which the phase 
differences of pilot carriers, computed from differences of every fourth symbol (S n - S n . 4) 
are plotted against frequency of the carriers. The line of best fit 604 is indicated. A slope 
of 0 would indicate no sampling rate error. 

20 : ' Upon receipt of control signal 606 from the pilot locate block 408 (Fig. 14), a 
frequency sweep is initiated by block 608, which inserts an offset into the low-pass 
filtered frequency error output using adder 610. Similarly a frequency sweep i t s initiated 
by block 61 2/ which inserts an offset into the low-pass filtered sampling rate error output 
using adder 614. The frequency sweeps are linear in increments of 1/8 of the carrier 

25 spacing steps, from 0 - 3.5kHz corresponding to control signal values of 0x0-0x7. 

A preferred embodiment of the correction circuitry 174 (Fig. 14) is shown in greater 
detail in Fig. 49, Continual pilots rather than scattered pilots are held in a memory store 
616 at a resolution of 14 bits. The generation of the multiplier x for the computation in 
the accumulation block 618 is more complicated, since in accordance with the noted 

30 ETS 300 744 telecommunications standard, the continual pilots are not evenly spaced 
as are the scattered pilots. However, it is now only necessary to evaluate 45 continual 
pilots (in 2K mode according to the ETS 300 744 telecommunications standard). In this 
embodiment only the continual pilots of one symbol need be stored in the store 616. 
Inclusion of the guard interval size, is necessary to calculate the total duration of the 

35 symbol T t , is received from the FFT window circuitry (block 166, Fig. 14) on line 620. 
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The input and output signals and signals relating to the microprocessor interface 
142 of the circuitry illustrated in Fig. 42 are described in tables 24, 25, 26, and Table 27 
respectively. The circuitry is further disclosed in Verilog code listings 24 - 35. 
Demapper 

5 .The demapping circuitry 176 (Fig. 1 5) is shown as a separate block for clarity, but 

in practice is integrated into the channel estimation and correction circuitry. It converts 
I and Q data, each at 12-bit resolution into a demapped 12,bit coded constellation 
format (3-bit I, I soft-bit, 3-bit Q, Q soft-bit). The coded constellation is illustrated in Fig. 
50 and Fig. 51. For 64-QAM the 3 bits are used for the I and Q values, 2 bits for 

10 16-QAM 2-bits and 1 bit for QPSK. 

-for example in Fig. 51 values of l= 6.2, Q= -3.7 would be demapped to: l-data = 
001 ; l;soft-bit=01 1 ; Q-data=1 01 ; Q soft-bit=1 01 . 

The input and output signals of the demapping circuitry 1 76 are described in tables 
28 and 29 respectively. 

15 Symbol Deinterleaver 

The symbol deinterleaver 1 82 (Fig. 1 5) reverses the process of symbol interleaving 
of the transmitted signal: As shown in Fig. 52 the deinterleaver requires a 1512 x 13 
memory store, indicated as block 622. The address generator 624 generates addresses 
to. write in interleaved data and read out data in linear; sequence. In practice the address 

20 generator 624 is realized as a read address generator and a separate write address, 
generator. Reading and writing occur at different instantaneous rates in order to reduce 
the burstiness of the data flow. The address generator 624 is resynchronized for each 
new COFDM symbol by a symbol timing pulse 626. Carrier of index 0 is marked by 
carrierO pulse 628. Addresses should be generated relative to the address in which this 

25 carrier is stored. 

The input and output signals of the symbol deinterleaver 182 are described in 
tables 30 and 31 respectively. Circuitry of the symbol deinterleaver 1 82 is disclosed in 
Verilog code listing 22. 
Bit Deinterleaver 

30 Referring to Fig. 54. the bit deinterleaver 1 84 (Fig. 1 5) reverses the process of bit- 

wise interleaving of the transmitted signal, and is shown further detail in Fig. 53. In soft 
encoding circuitry 630 input data is reformatted from the coded constellation format to 
a 24 bit soft l/Q format. The soft encoding circuitry 630 is disclosed for clarity with the 
bit deinterleaver 184, but is realized as part of the symbol deinterleaver discussed 

35 above. The deinterleave address generator 632 generates addresses to read the 6 
appropriate soft-bits from the 126 x 24 memory store 634, following the address 
algorithm in the ETS 300 744 telecommunications standard. The deinterleave address 
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generator 632 is resynchronized for each new COFDM symbol by the^symbol timing 
pulse 626. 

The output interface 636 assembles I and Q output data streams from soft-bits 
read from the memory store 634. Three I soft bits and three Q soft bits are extracted 
from the memory store 634 at each deinterleave operation, and are parallel-serial 
converted to provide the input data stream to the Viterbi Decoder 186 (Fig. 15). 

* The input and output signals of the bit deinterleaver 184 are described in tables 
32 and 33 respectively. Circuitry of the bit deinterleaver 1 84 is disclosed in Verilog code 
listing 23. • - ■ ■ - 
Host Microprocessor Interface 

The function of the microprocessor interface 1 42 is to allow a host microprocessor 
to access. control and status- information within the multicarrier digital receiver 126 (Fig. 
12). The microprocessor interface 142 is shown in greater detail in Fig. 55. A serial 
interface 638 and a parallel interface 640 are provided, the latter being primarily of value 
for testing and debugging. The serial interface 638 is of known type and is I2C 
compatible.' The microprocessor interface 142 includes a maskable interrupt capability 
allovving the receiver to be configured to request processor intervention depending on 
intemalconditions. It should be noted, that the multicarrier digital receiver 126 does hot 
depend on intervention of the microprocessor interface 142 for any part of its normal 
operation. 

. The use of interrupts from the point of view of the host processor is now described. 
"Event" is the term used to-describe an on-chip condition -that a user might want to 
observe. An event could indicate an error condition or it could be informative to user 
software. There are two single bit registers (not shown) are associated with each 
interrupt or event. These are the condition event register and the condition mask 
register. . - ' 

The condition event register is a one bit read/write register whose value is set to 
one by a condition occurring within the circuit. The register is set to one even if the 
condition only existed transiently. The condition event register is then guaranteed to 
-remain set to one until the user's software resets it, or the entire chip is reset. The 
condition event register is cleared to zero by writing the value one. Writing zero to the 
condition event register leaves the register unaltered. The condition event register must 
be set to zero by user software before another occurrence of the condition can be 
observed. - 

The condition mask register is a one bit read/write register which enables the 
generation of an interrupt request if the corresponding condition event register is set. If 
the condition event is already set when 1 is written to the condition mask register an 
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interrupt request will be generated immediately. The value 1 enables interrupts. The 
condition mask register clears to zero on chip reset. Unless stated otherwise a block will 
stop operation after generating an interrupt request and will restart soon after either the 
condition event register or the condition mask register are cleared. 

Event bits and mask bits are always grouped into corresponding bit positions in 
consecutive bytes in the register map. This allows interrupt service software to use the 
value read from the mask registers as a mask for the value in the event registers to 
identify which event generated the interrupt. There is a single global event bit that 
summarizes the event activity on the chip. The chip event register presents the OR of 
all the on-chip events that have 1 in their respective mask bit. A value of 1 in the chip 
masf 'bit allows the chip to generate interrupts. A value of 0 in the chip mask bit 
prevents any on-chip events from generating interrupt requests. Writing 1 or 0 to the 
chip event register has no effect. The chip event register only clears when all the events 
enabled by a 1 in their respective mask bits have been cleared. 

The IRQ signal 642 is asserted if both the chip event bit and the chip event mask 
are set. The IRQ signal 642 is an active low, "open collector" output which requires an 
off-chip pull-up resistor. When active the IRQ output is pulled down by an impedance 
'of 1 00Q or less. A pull-up resistor of approx. 4kO is suitable. 

The input and output signals of the microprocessor interface 142 are described in 
tables 34 and 35 respectively. 
System Controller 

The system controller 1 98 (Fig. 15), which controls the operation of the multicarrier 
digital receiver 126 (Fig. 12), in particular channel acquisition and the ; handling of error 
conditions, is shown in further detail in Fig. 56. 

'" Referring to the state diagram in Fig. 57, the channel acquisition sequence is 

driven by four timeouts. 

(1) AGC acquisition timeout. 20 ms (80 symbols) are allowed for the AGC to bring 
up the signal level, shown in step 644. Then the FFT window is enabled to start 
acquisition search in block 646. 

(2) Symbol acquisition timeout: 200 symbol periods, the maximum guard interval 
plus active symbol length, is allocated to acquire the FFT. windowjn step 648. Another 
35 symbol periods are allocated to pilot location in step 650. Approximately 50 ms are 
required to process 2K OFDM symbols. An option is provided to exit step 650 as soon 
as the pilots have been located to save acquisition time in non-extreme situations. 

(3) Control Loop Settling timeout: A further 10 ms, representing approximately 40 
symbols is allocated to allow the control loops to settle in step 652. An option is provided 
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to exit step 652 and return to an initial step resync 654 if pilots have been; lost if control 
loop settling timeout occurs. 

(4) Viterbi synchronization timeout: In block 656 approximately 150 symbol periods 
are allocated for the worst case of tps synchronization, indicated by step 658 and 
5 approximately 100 symbol periods for the Viterbi Decoder 186 (Fig. 15) to synchronize 
to the transmitted puncture rate, shown as step 660. This is approximately 65 ms. In 
reasonable conditions it is unnecessary to wait this long. As soon as Viterbi synchroni- 
zation is established, then transition to the system Jock state 662. It is possible to 
bypass the tps synchronization requirement by setting parameters (see table below) in 
10 the receiver parameters register and setting set_rx_parameters to 1 . 

If acquisition fails at any stage, the process automatically returns to .step resync 
654 for retry. 

Having acquired lock, the system will remain in lock unless a Reed-Solomon 
overload event occurs, i.e. the number of Reed-Solomon packets with uncorrectable 
15 . errors exceeds a predetermined value (the rsojimit value) in any 1 second period. If 
any of the 4 synchronizing state machines in the acquisition sequence, FFT window 
(step 648), pilot locate (step 650), tps synchronization (step 658) and Viterbi synchroni- 
zation (step 660), lose synchronization once channel acquisition has occurred, ho action 
will be taken until an event, rso^event, occurs and the step resync 654 is triggered 
20 automatically. 

In poor signal conditions acquisition may be difficult, particularly the Viterbi 
synchronization. Therefore a bit is optionally provided in the microprocessor interface 
142 ( Fig. 12), which when set extends the timeouts by a factor of 4. > 

The input and output signals, and the microprocessor interface registers of the 
25 system controller 198 are described in tables 36, 37, 38, and 39 respectively. 



Tables 



Pin Name 


I/O 


Description 


Tuner/ADC Interface 


SCLK 


o 


Sample clock for ADC- 


IDATA[9:0] 


I 


Input ADC data bus (10-bit) 


AGC 


o 


Automatic Gain Control to 
tuner(Sigma-Delta output) 


XTC[2:0] 


o 


External Tuner Control Outputs 


MPEG-2 Transport Interface 
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While this invention has been explained with reference to the structure disclosed 
herein, it is not confined to the details set forth and this application is intended to cover 
any modifications and changes as may come within the scope of the following claims: 
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CLAIMS 

1 1 . A digital receiver for multicarrier signals comprising: 

2 an amplifier accepting an analog multicarrier signal, wherein said multicarrier 

3 signal comprises a stream of data symbols having a symbol period T s , wherein the 

4 symbols comprise an active interval, a guard interval, and a boundary therebetween, 

5 said guard interval being a replication of a portion of said active interval; 

6 an analog to digital converter coupled to said amplifier; 

7 an l/Q demodulator for recovering in phase and quadrature components from 

8 data sampled by said analog to digital converter; 

9 an automatic gain control circuit coupled to said analog to digital converter for 

10 providing a gain control signal for said amplifier; 

11 a low pass filter circuit accepting I and Q data from said l/Q demodulator, wherein 

12 said I and Q data are decimated; 

13 a resampling circuit receiving said decimated I and Q data at a first rate and 

14 outputting resampled I and Q data at a second rate; 

15 an £FT win dow synchronization ^ circuit coupled to said resampling circuit for 

16 locating a boundary of said guard interval; 

17 a real-time pipelined FFT processor operationally associated with said FFT 

18 window synchronization circuit, wherein said FFT processor comprises. at least one 

19 stage, said stage comprising: 

20 a com£le)c^oeffi^ 

21 -a memory having a lookup table defined therein for multiplicands being 

22 multiplied in said complex coefficient multiplier, a value of each said multiplicand 

23 being unique in said lookup table; and 

24 a monitor circuit responsive to said FFT window synchronization circuit for 

25 detecting a predetermined event, whereby said event indicates that a boundary between 

26 an active symbol and a guard interval has been located. 

1 2 r The receiver according to claim 1 , wherein said FFT window synchronization 

2 circuit comprises: 

3 a first delay element accepting currently arriving resampled I and Q data, and 

4 outputting delayed resampled I and Q data; 

5 a subtracter, for producing a difference signal representative of a difference 

6 between said currently arriving resampled I and Q data and said delayed resampled I 

7 and Q data; 
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8 a first circuit for producing an output signal having a unipolar magnitude that is 

9 representative of said difference signal of said subtracter; 

1 0 a second delay element for storing said output signal of said first circuit; 

1 1 a third delay element receiving delayed output of said second delay element; and 

12 a second circuit for calculating a statistical relationship between data stored in 

1 3 said second delay element and data stored in said third delay element and having an 

14 output representative of said statistical relationship. 

1 , 3. The receiver according to claim 2, wherein said statistical relationship 

2 comprises an F ratio. 

1 ^4; The receiver according to claim 1 , wherein said FFT processor operates in an 

2 8K mode. 

1 5. The receiver according to claim 1 , wherein said wherein said FFT processor 

2 further comprises an address generator-for said memory, said address generator 

3 accepting a signal representing an order dependency of a currently required multipii- 

4 .cand, and outputting an address of said memory wherein said currently required 

5 multiplicand is stored. 

1 6. The receiver according to claim 5, wherein each said multiplicand is stored in 

2 said lookup table in order of its respective order dependency for multiplication by said 

3 complex coefficient multiplier, said order dependencies of said multiplicands defining 

4 an incrementation sequence, and said address generator comprises: 

5 ;r~an accumulator for storing a previous address that was generated by said 

6 address generator; 

7 a circuit for calculating an incrementation .value of said currently required 

8 multiplicand; and 

9 an adder for adding said incrementation value to said previous address. 

1 7. The receiver according to claim 6, wherein said lookup table comprises a 

2 plurality of rows, and said incrementation sequence comprises a plurality of. 

3 incrementation sequences, said multiplicands being stored in row order, wherein 

4 in a first row a first incrementation sequence is 0; 

5 in a second row a second incrementation sequence is 1; 

6 in a third row first and second break points B1 , B2 of a third incrementation 

7 sequence are respectively determined by the relationships 
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B2 Mn = £4" 

n=0 

10 ; and 

11 in a fourth row a third break point B3 of a third incrementation sequence is 

12 ' determined by the relationship 
1 3 

B3 M = 2 x 4 N + 2 

14 wherein M N represents the memory of an Nth stage of said FFT processor. 

1 8. The receiver according to claim 1 , further comprising channel estimation and 

2 correction circuitry comprising: 

3 - pilot location circuitry receiving a transformed digital signal representing a frame 

4 from said FFT processor for locating pilot carriers therein, wherein said pilot carriers are 

5 spaced apart in a carrier spectrum of said transformed digital signal at intervals^ and 

6 have predetermined magnitudes, said pilot location circuitry comprising: - 

7 . ' a first-circuit for computing, an.. orderof carriers in said-transformed digital signal 

8 modulo K; 

9 K accumulators coupled to said second circuit for accumulating magnitudes of 

10 said carriers in said- transformed digital signal, said accumulated magnitudes defining 

1 1 a set; and 

12 a correlation circuit for correlating:K sets of accumulated magnitude values with 

13 said predetermined magnitudes, wherein a first member having : a position calculated 

14 modulo K in each of said K sets is uniquely offset from a start position of said frame. 

1 9. The receiver according to claim 8, wherein said pilot location circuitry further 

2 comprises a bit reversal circuit for reversing a bit order of said transformed digital signal. 

1 10. The receiver according to claim 7, wherein said magnitudes of said carriers 

2 and said predetermined magnitudes are amplitudes. 
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1 1 . The receiver according to claim 7 t wherein said magnitudes of said carriers 
and said predetermined magnitudes are absolute values. 
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1 12. The receiver according to claim 7, wherein said correlation circuitry further 

2 comprises a peak tracking circuit for determining a spacing between a first peak and a 

3 second peak of said K sets of accumulated magnitudes. 

1 13. The receiver according to claim 7, wherein said channel estimation and 

2 correction circuitry further comprises: 

3 an interpolating filter for estimating a channel response between said pilot 

4 carriers; and 

5 a multiplication circuit for multiplying data carriers output by said FFT processor 

6 with a correction coefficient produced by said interpolating filter. 

1 ~w -14. The receiver according to claim 7, wherein said channel estimation and 

2 correction circuitry further comprises 

3 a phase extraction circuit accepting a data stream of phase-uncorrected I and Q 

4 data from said FFT processor, and producing a signal representative of a phase angle 

5 of said uncorrected data, said phase extraction circuit including an accumulator for 

6 accumulating the phase angles of succeeding phase-uncorrected I and Q data. 

1 15. The receiver according to claim 1 4, said channel estimation and correction 

2 circuitry further comprises: 

3 an automatic frequency control circuit coupled to said phase extraction circuit 

4 and said accumulator, comprising; 

5 a memory for storing an accumulated common phase error of a first symbol 

6 carried in said phase-uncorrected I and Q data; 

7 ~; T wherein said accumulator is coupled to said memory and accumulates a 

8 difference between a common phase error of a plurality of pilot carriers in a second 

9 symbol and a common phase error of corresponding pilot carriers in said first symbol; 
10 an output of said accumulator being coupled to said l/Q demodulator. 

1 16. The receiver according to claim 15, wherein said coupled output of said 

2 accumulator is enabled~in said l/Q demodulator only during reception of a guard interval 

3 therein. 

1 17. The receiver according to claim 1 4, said channel estimation and correction 

2 circuitry further comprises an automatic sampling rate control circuit coupled to said 

3 phase extraction circuit, comprising: 
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4 a memory for storing accumulated phase errors of pilot carriers irf a first symbol 

5 carried in said phase-uncorrected I and Q data; 

6 wherein said accumulator is coupled to said memory and accumulates 

7 differences between phase errors of pilot carriers in a second symbol and phase errors 

8 of corresponding pilot carriers in said first symbol to define a plurality of accumulated 

9 : v intersymbol carrier phase error differentials, a phase slope being defined by a difference 

10 between a first accumulated intersymbol carrier phase differential and a second 

1 1 accumulated intersymbol carrier phase differential; 

12 an output of said accumulator being coupled to said l/Q demodulator. 

1 18. The receiver according to claim 17, wherein said sampling rate control circuit 

2 stores a plurality of accumulated intersymbol carrier phase error differentials and 

3 computes a line of best fit therebetween. 

1 19. The receiver according to claim 1 7, wherein said coupled output signal of said 

2 accumulator is enabled in said resampling circuit only during reception of^a guard 

3 . interval therein. - h 

1 20. The receiver according to claim 17, wherein a common memory for storing 

2 output of said phase extraction circuit is coupled to said automatic frequency control 

3 circuit and to said automatic sampling rate control circuit. 

1 * 21. The receiver according to claim 14, wherein-said phase extraction circuit 

2 further comprises: 

3 a pipelined circuit for iteratively computing the arctangent of an angle of rotation 

4 according to the series ~ r 

5 - - " 

tan-'(x) = x-^+ + . | X |<1 

3 5 7 9 

6 wherein x is a ratio of said phase-uncorrected I and Q data. 

1 22. The receiver according to claim 21 , wherein said pipelined circuit comprises: 

2 a constant coefficient multiplier; and 

3 a multiplexer for selecting one of a plurality of constant coefficients of said series, . 

4 an output of said multiplexer being connected to an input of said constant coefficient 

5 multiplier. 
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1 23. The receiver according to claim 21 , wherein said pipelined circuit comprises: 

2 a multiplier; 

3 a first memory for storing the quantity x 2 , said first memory being coupled to a 

4 first input of said multiplier; 

5 a second memory for holding an output of said multiplier; and 

6 a feedback connection between said second memory and a second input of said 

7 multiplier. 

1 ... 24. The receiver according to claim 21, wherein said pipelined circuit further 

2 comprises: 

3 :-?sa third memory for storing a value of said series; 

4 ,. a control circuit, coupled to said third memory, wherein said pipeline circuit 

5 computes N terms of said series, and said pipeline circuit computes N+1 terms of said 

6 series, wherein N is an integer; * 

7 ... an averaging circuit coupled to said' third memory for computing an average of 

8 said N terms and said N+1 terms of said series. 

1 25. The receiver according to claim 1 f wherein data transmitted in a pilot carrier 

2 of said multicarrier signal is BCH encoded according to a code generator polynomial 

3 h(x) f further comprising: 

4. a demodulator operative on said BCH encoded data; 

5 an iterative pipelined BCH decoding circuit, comprising: 

6 jssl. a circuit coupled to said demodulator for forming a Galois Field of said 

7 ^polynomial, and calculating a plurality of syndromes therewith; 

8 a plurality of storage registers, each said storage register storing a 

9 respective one of said syndromes; 

10 a plurality of feedback shift registers, each said feedback shift register 

1 1 ^ accepting data from a respective one of said storage registers and having an 

12 output; 

13 a plurality of Galois field multipliers, each said multiplier being connected 

14 in a feedback loop across a respective one of said feedback shift registers and 

15 multiplying the output of its associated feedback shift register by an alpha value 

1 6 of said Galois Field; 

1 7 an output Galois field multiplier for multiplying said outputs of two of said 

18 feedback shift registers; 
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19 an error detection circuit connected to said feedback shift; registers and 

20 said output Galois field multiplier, wherein an ouput signal of said error detection 

21 circuit indicates an error in a current bit of data; and . 

22 a feedback line enabled by said error detection circuit and connected to 

23 said storage registers, wherein outputs of said feedback shift registers are written 

24 into said storage registers. 

1 26. The receiver according to claim 25, wherein said output Galois field multiplier 

2 comprises: 

3 a first register initially storing a first multiplicand A; 

4 a constant coefficient multiplierrconnected to said register for multiplication by a 

5 value a, an output of said constant coefficient* multiplier beingxonnected to said first 

6 register to define a first feedback loop, whereby in a kth cycle of clocked operation said 

7 first register contains a Galois field product Aa k ; 

8- a second register for storing a second multiplicand B; 

9 an- AND gate connected, to said second register and to said output of said 

10 constant coefficient multiplier; 

11 an adder having a first input connected to an output of said AND gate; 

12 an accumulator connected to a second input of said adder; wherein an output of 

13 said adder is connected to said accumulator to define a second feedback loop; 

14 whereby a Galois field product AB is output by said adder. 

1 27. A method for estimation of a frequency response of a channel, comprising 

2 the steps of: _ 

3 receiving from a channel a <multicarrier signal having a plurality of data carriers 

4 and scattered pilot carriers, said scattered pilot carriers being spaced apart at a first 

5 interval N and being transmitted at a power that differs from a transmitted power of said 

6 data carriers; 

7 converting said multicarrier signal to a digital representation thereof; 

8 performing a Fourier transform on. said digital representation of said multicarrier 

9 signal to generate a transformed digital signal; 

10 reversing a bit order of said transformed digital signal to generate a bit-order 

1 1 reversed signal; 

12 , cyclically accumulating magnitudes of carriers in said bit-order reversed signal 

13 in N accumulators; 

14 correlating said accumulated magnitudes with said power of said scattered pilot 

1 5 carriers; 
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16 responsive to said step of correlating, generating a synchronizing signal that 

17 identifies a carrier of said multicarrier signal. 

1 28. The method according to claim 27, wherein said step of accumulating 

2 magnitudes comprises the steps of: 

3 adding absolute values of a real component of said bit-order reversed signal to 

4 respective absolute values of imaginary components thereof to generate sums; 

5 respectively storing said sums in said accumulators. 

1 ... 29. The method according to claim 27, wherein said step of correlating said 

2 accumulated magnitudes further comprises the step of: 

3 identifying a first accumulator having a highest value stored therein representing 

4 a first carrier position. 

1 30. The method according to claim 29, wherein said step of correlating said 

2 accumulated magnitudes further comprises the steps of: 

3 identifying a second accumulator having a second highest value stored therein 

4 representing a second carrier position; and 

5 determining an interval between said first carrier position and said second carrier 

6 position. 

1 31 . The method according to claim 27, further comprising the steps of: 

2 comparing a position of a carrier of a first symbol in said bit-order reversed signal 

3 with a position of a carrier of a second symbol therein. 

1 32. The method according to claim 27, further comprising the steps of: 

2 interpolating between pilot carriers to determine correction factors for respective 

3 intermediate data carriers disposed therebetween; and 

4 respectively adjusting magnitudes of said intermediate data carriers according 

5 to said correction factors. 

1 33. The method according to claim 27, further comprising the steps of: 

2 determining a mean phase difference between corresponding pilot carriers of 

3 successive symbols being transmitted in said transformed digital signal; and 

4 generating a first control signal responsive to said mean phase difference; and 

5 responsive to said first control signal adjusting a frequency of reception of said 

6 multicarrier signal. 
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7 34. The method according to claim 33, further comprising the steps of: 

8 determining a first phase difference between a first data carrier of a first symbol 

9 in said transmitted data carrier and said first data carrier of a second symbol therein; 

1 0 determining a second phase difference between a second data carrier of "'said first 

1 1 symbol and said second data carrier of said second symbol; and 

12 - determining a difference between said first phase difference and said second 

1 3 phase difference to define a phase slope between said first data carrier and said second 

1 4 data carrier; 

15 generating a second controLsignal responsive to said phase slope; and 

1 6 responsive to said second control signal adjusting a sampling frequency of said 

17 multicarrier signal. . ~ " ' : 

18 35. The method -according to claim 34, wherein said step of determining a"' 

19 difference between said first phase difference and said second phase difference 

20 comprises computing a line of best fit. 
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